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PRODUCTS COMPRISING SUBSTRATES CAPABLE 
OF ENZYMATIC CROSS -LINKING 

CROSS-REFERENCE TO RELATED APPLICATIONS ; 

This application is a continuation-in-part of 
application serial no. 08/205,518, filed March 3, 1994, 

INTRODUCTION 



Technical Field 

The field of this invention is polymeric materials 
having multiple sequences which are capable oi: covalent 
cross-linking by enzymatic reaction, particularly as 
15 implantable resorbable protein polymers. 

Background 

Enzymes are biological catalysts that accelerate 
chemical reactions. These chemical reactions fall into many 

20 classes. Of particular interest are enzymes whose substrates 
are proteins, where the enzyme catalyzes the covalent cross- 
linking of other compounds to the proteins. Striking 
characteristics of all enzymes are their catalytic power and 
specificity. Enzymes accelerate reactions by factors of at 

25 least a million. They are highly specific both in the 
reaction catalyzed and in their choice of reactants, called 
substrates. Enzyme catalysis allows reactions to occur under 
physiological conditions. 
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There are numerous examples of enzymes which have been 
modified to allow them to operate under more extreme 
conditions, such as low pH or high temperature, to develop 
more useful products. There are enzymes which have been 
5 modified and shown to have different activities, including 
their substrate specificity. A wide variety of conditions 
have been shown to modify the catalytic activity of enzymes, 
such as the use of non-polar solvents. However, there have 
been relatively few attempts to produce substrates: not found 
10 in nature for enzymes where the substrate will be covalently 
coupled to another compound, providing for products of 
substantial utility. In particular, the use of non-natural 
substrates containing multiple sites for enzymatic cross- 
linking in order to modify selected properties of natural 
15 substrate/enzyme reaction products is not believed to have 
been previously demonstrated. Nevertheless, for particular 
product applications there are deficiencies; in the 
performance of natural substrate/enzyme systems which make 
it desirable to produce such substrates. 
20 Peptide synthetases, acyl transferases, glycosyl 

transferases, phosphotransferases have varying degrees of 
specificity as to their ability to form covalent bonds 
between two molecules. To the extent that the en3!yme is not 
too fastidious in its substrate, polyfunctional molecules 
25 may be employed which cross-link to form structurailly strong 
products, which may serve to bond or cement various parts or 
constituents of a body or organism. Thus, by having a 
polyfunctional polymer with cross-linking elements, where 
the polymer is adherent to the parts or constituents, by 
30 cross-linking the polymer, the parts or constituimts may be 
bonded together. 

A compelling example is in the production of an 
adhesive to bond separated tissues. Sutures and jstaples are 
effective and well established wound closure devices. 
35 However, there are surgical procedures where classical 
repair procedures are unsatisfactory, limited to highly 
trained specialists (e.g. microsurgery), or not applicable 
due to tissue or organ fragility, inaccessibility (e.g. 
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endoscopy procedures), or fluid loss, including capillary 
"weeping". Tissue adhesives and sealants have been developed 
to meet these needs. They may be used to seal or reinforce 
wounds that have been sutured or stapled, as well as finding 
5 independent use. The leading commercial products are fibrin 
glues and cyanoacrylates. However, both prodiucts have 
significant limitations which have prevented their 
widespread use. 

Cyanoacrylates are mainly used for cutaneous wound 

10 closure in facial and reconstructive surgery. Th€5 appeal of 
cyanoacrylates is its speed of bonding, which is almost 
immediate, and its great bond strength. However, its speed 
of bonding can be a disadvantage, since glued tisisue must be 
cut again in order to reshape it to approximate its original 

15 conformation. Additionally, it can only be used on dry 
substrates since its mode of action is through a mechanical 
interlock and it is relatively inflexible compared to 
surrounding tissue. Cyanoacrylates are also known to be 
toxic to some tissues and although it is not considered to 

20 be biodegradable, potential degradation products are 
suspected to be carcinogenic. 

Fibrin glues comprising blood-derived fibrinogen and 
thrombin function primarily as a sealant and hcsmostat and 
have been used in many different surgical procedures within 

25 the body. They have been shown to be non-toxic, 
biocompatible and biodegradable. They are able to control 
excessive bleeding and decrease fibrosis. However, tissues 
bonded with fibrin cannot be subjected to evein moderate 
tensile stress without rupturing the bond. It takes 3-10 

30 minutes for an initial bond to develop, but requires 30 
minutes to several hours for full strength to develop. 
Depending upon the application, the product may also resorb 
too quickly. Fibrin glues derived from heterologous human 
and animal sera may provoke undesirable immune responses, 
35 and expose the patient to the potential rislc of viral 
infection. Autologous fibrin glues may be impractical to 
obtain and use and may compromise patient safety. 
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There is, therefore, substantial interest in developing 
products which have the biocompatibility of fibrin glues but 
which set more quickly and have enhanced strength. A product 
having such attributes which is also not derived i:roro blood 
5 or animal sources is also of interest. 

^Aj^vant Literature 

Enzymes and their function are described in Enzymes, * 

3rd Edition, Dixon, M. and Webb, E.G., eds.. Academic Press, 
10 NY 1979; and The Enzymes, 3rd edition, Boyer, P.D-, Academic 

Press, NY 1970- . Tissue adhesives are described in Tissue 

Adhesives in Surgery, Matsumoto, T., Medical Examination 

Publishing Co., Inc., 1972 and Sierra, D.H., J. Biomat. App. 

7:309-352, 1993. Methods of preparation of protein polymers 
15 having blocks of repetitive units are described in 

U.S. Patent No. 5,243,038 and EPA 89.913054.3. 

SUMMARY OF THE INVENTION 
Polymeric compositions and methods for their use are 

20 provided, where the polymeric compositions are capable of 
enzyme catalyzed reaction involving covalent, normally 
peptide bond, cross-linking. Polyfunctional polymers are 
employed, where the polymers or the polymer(s) in conjuction 
with low molecular weight polyfunctional cross-linking 

25 agents, are combined with an enzyme to provide for cross- 
linking with increase in tensile properties. The 
compositions find particular application for use in 
biological systems, where in situ formation of a 
biocompatible material having structural integrity is 

30 desired. For applications within the body, the materials may 
be subject to resorption over a predetermined time period, 
particularly by modifying their susceptibility to the action 
of specific proteases. Preferred compositions comprise a 
plurality of amino acid sequences which are capable of 

35 transamidase reaction, such as catalyzed by factor Xllla, to 

form an isopeptide bond, where the sequence is a side chain • 
of a polymeric backbone or is part of the baclcbone. The 
compositions find use as medical adhesives and sealants, in 
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the closure of wounds and repair of damaged tissues, 
prosthesis coatings, drug depots, matrices for the 
transplantation of cells and the like. 

Alternatively, the compositions may be used as 
5 substrates for enzyme catalyzed reaction to bind various 
agents to a site comprising the subject inventions, or the 
like. 

« 

DESCRIPTIO N OF THE SPECIFIC EMBODIMENTS 
10 Compositions and their uses are provided, where the 

compositions are comprised of high molecular weight 
polymeric compositions comprising a plurality of sequences 
serving as substrates for enzymatic cross-linking. By 
employing enzymes such as transferases and synthetases, 
15 which can act on polyf unctional substrates, so as to form 
covalent bonds at recognition sites, a cioss-linked 
structure can be produced, having enhanced tensile 
properties. The recognition sites may comprise naturally 
occurring or mutated consensus sequences oi' in some 
20 instances a single reactive amino acid or functional group. 
By employing polymers having good adhesive j>roperties, 
particularly after cross-linking, the resulting cross-linked 
product finds use in bonding parts together, such as body 
parts or parts of an organism. 
25 Synthetic proteins may be prepared which have amino 

acids which are susceptible to enzyme catalyzed cross- 
linking. By appropriate selection of enzyme recognition 
sites, the synthetic protein may be cross-linked by itself 
or in conjuction with small polyfunctional molecules or 
30 polymers. Illustrative of such polymers are polymers 
comprising recognition sites which can result in peptide 
cross-links, particularly as substraites for 
transglutaminases, such as plasma factor Xllla, or modified 
forms of these natural enzymes. The sequences may be the 
35 naturally occurring sequences or modified sequences, 
' particularly where the glutamine or lysine involved with 

isopeptide bond formation or amino acids surrounding them 
are substituted by different amino acids in order to 



5 



wo 95/23611 



PCrrAJS95/02728 



modulate the enzyme catalyzed reaction of the lysine or 
glutamine. 

Alternatively, synthetic peptides conferring the 
activities described above may be covalently conjiugated to 
5 high molecular weight carriers in order to promote the 
intermolecular cross-linking of the carrier molecules. An 
adhesive product composed of such peptide conjugated 
carriers will attain its setting speed and bonding strength 
on the basis of the cross-linking chemistry and the adhesive 

10 properties supplied by the peptide sequences, as well as the 
intrinsic properties of the polymer. However, additional 
beneficial properties may also be provided by the carrier 
molecules such as additional reactivity, solubility, 
viscoelasticity , adhesivity , biocompatibility , 

15 bioresorption, and biofunctionality. Examples of carrier 
molecules which may be used are proteins such as collagen, 
fibrinogen, casein, keratin, and their derivatives; 
polysaccharides such as hyaluronic acid, chitosan, heparin, 
glycosaminoglycans, dextran, cellulose and their 

20 derivatives; and synthetic polymers such as polyethylene 
glycol, polyvinyl alcohol, polyesters, polyacryiLates , and 
their derivatives. Depending on the purpose of the product 
and the context in which it is used, the composition may be 
physiologically compatible or not. The primary application 

25 will be in conjunction with physiological uses. 

Enzymes of interest are those that have an amino acid 
as one of their substrates. The enzymes may react with a 
particular amino acid, where there is no specific 
requirement for flanking amino acid sequences or they may 

30 require that the reactive amino acid be positioned within a 
particular amino acid sequence. The other substrate for the 
enzymes can fall into a variety of different classes, 
including: an amino acid, individually or as part of a 
peptide or protein; specific chemical groups; such as 

35 phosphates and sulphates, whether as part of a larger 
molecule or not; monosaccharides, disaccharides, and 
polysaccharides; and lipids and fatty acids. The common 
element defining the reaction which occurs between the 
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substrates is that a covalent bond is established, forming 
a larger molecule than either of the individual substrates, 
normally involving the formation of amides, esters, ethers 
and alkylation of amines. Therefore, the cross-linking will 
5 normally be other than the formation of a disulfide bond 
between two cysteines and will usually involve a carbon 
atom, particularly an oxo or oxy group, more particularly, 
a carboxy group, where esters and amides are formed. As 
indicated above, in some instances with inorganic acid 
10 groups, other than carboxy ester or amide formation may be 
involved, inorganic esters,' amides or anhydrides being 
formed. 

Examples of such enzymes include: lysyl oxidase, which 
initiates the formation of a covalent bond between lysine 

15 residues on adjacent protein molecules, used in the cross- 
linking of elastin and also collagen, where one of the 
lysines is oxidized to act as an aldehyde group for 
formation of an imine; phosphorylases, such as cellular 
phosphorylase A or B, which couple phosphate and serine; 

20 glycosylases, which typically couple mono- or disaccharides 
to proteins through the amino acids serine or cisparagine, 
and the enzymes which are responsible for the coupling of 
polysaccharides and proteins in the formation of 
glycosaminoglycans; fatty acyltransf erases, which are 

25 involved in the coupling of lipids and fatty acids onto 
proteins; and transamidases, particularly transglutaminases, 
which couple amino acids through the formation of an 
isopeptide bond, particularly the amino acids glutamine and 
lysine. 

30 As is common among the different categories of enzymes, 

there may be many different enzymes which catalyze the same 
basic chemical reaction but which have different catalytic 
rates or substrate specificities. For example, among the 
transglutaminases, there are liver, muscle, epithelial (or 

35 "tissue"), and keratinocyte transglutaminases, which among 
them include different specificities for the amino acids 
flanking the glutamine residue. All use lysine as the 
complementary substrate. 
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Factor Xllla is a transglutaminase which forms a 
covalent isopeptide linkage between an available iLysine and 
a specific glutamine within a defined peptide consensus 
sequence of the fibrin gamma chain, where individual fibrin 

5 chains are held together as a complex with other subunits of 
fibrinogen. This provides a mechanism for covalent 
attachment between fibrin chains and from fibrin chains to 
extracellular matrix proteins (e.g. collagen, fibronectin) 
and is the chemistry underlying current fibrin-based glues. 

10 There are two distinct factor Xllla species, one derived 
from plasma and one derived from placenta, both having 
equivalent activity for fibrin. Human fibrin has two 
identical sites which are covalently cross-linked by 
activated factor XIII (factor Xllla) to form an adhesive 

15 fibrin matrix. Fibrin glue was used as a model for 
demonstrating that substrates containing multiple sites for 
enzymatic cross-linking can modify selected properties of 
natural substrate /enzyme reaction products. 

As exemplary of the use of physiologically acceptable 

20 polymers is the use of peptide sequences which serve as 
recognition sites for isopeptide formation. These subject 
polymeric compounds may comprise a plurality of naturally 
occurring complete or minimal consensus sequenc:es and/or 
mutated sequences for enzyme catalyzed isopeptide formation. 

25 The subject polymeric compositions may be divided into a 
number of categories: (1) the isopeptide substrate sequence, 
e.g. factor Xllla ("FXIIIa") substrate sequence, is part of 
a synthetic protein sequence having at least two isopeptide 
substrate sequences in the chain of the polymer, where the 

30 isopeptide substrate sequence is (a) the natural substrate 
sequence of an enzyme; or (b) a mutated sequence; or (2) the 
isopeptide substrate sequence is a side chain of a polymer, 
where the isopeptide substrate sequence is (a) the natural 
substrate sequence; or (b) a mutated sequence. 

35 Of particular interest are synthetic peptideis that can 

serve as factor Xllla substrates, where the peptides or 
conjugation products of high molecular weight: polymers 
containing reactive amino groups and pendent factor Xllla 
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substrate moieties can be covalently cross-link<id via the 
action of activated factor XIII , a ubiquitous, plasma 
clotting enzyme. Conjugates can be produced via a multitude 
of acceptable peptide conjugation chemistries as ].ong as the 
5 chemistry maintains an accessible and reactive glutamine 
and/or lysine residue in the peptide. The adtive amino 
group can be supplied on the same peptide (via a lysine 
residue or any aliphatic amine that may serve as a lysine 
substitute) allowing a single conjugated species to form the 
10 cross- linked adhesive matrix or on a separate peptide or 
compound allowing for a mixture of .the two molecules to 
provide adhesive cross-linking. The mixture of two 
compounds allows the control and adjustment of the 
concentrations of each species which may be advantageous in 
15 optimization of adhesive properties. 

For the desired degree of cross-linking, there will be 
a spacer or intervening sequence of at least about 25 amino 
acids, or their atom equivalent with non-peptidic polymers, 
between the same reactive amino acids, usually between 
20 reactive amino acids. By way of illustration in referring 
to the same amino acids, this would intend glutamine capable 
of enzyme catalyzed cross-linking, while in reif erring to 
reactive amino acids, this would intend both glutamine and 
lysine capable of enzyme catalyzed cross-linking. This 
25 spacer will be present where the reactive amino acid is part 
of the polymeric chain or is part of a pendent sequence. 
Desirably there will be at least about 30 amino acids 
between the same reactive amino acids, more preferably at 
least about 40 amino acids. The intervening amino acids 
30 will include amino acids as part of the consensus sequence, 
as appropriate, and other amino acids which may play a 
variety of roles. 

The polymers will be at least 15 kD, general ly at least 
about 35 kD, more usually at least about 50 kD and generally 
35 not above 250 kD, usually not above about 125 kD in 
molecular weight. 

The protein polymer may be a sequence which has as its 
only repetitive motif the consensus sequence or may have 
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repetitive domains, which comprise the reactive zimino acid 
and an intervening sequence. For the most part, the protein 
polymer will have repetitive blocks, where the blocks may be 
the same or different, there usually being not more than 
5 about 3 different blocks. The blocks will be at least about 
10 amino acids, usually at least 20 amino acids, more 
usually at least about 40 amino acids, preferably at least 
about 50 amino acids, and may be at least aboui: 65 amino 
acids or more, usually not more than about 200 amino acids. 
10 The amino acids between the reactive amino acids may 

play a passive role in providing for the molecular weight of 
the protein, without introducing undesirable properties, or 
may play an active role, in providing for particular 
structural properties, e.g. tensile properties, 
15 conformation, hydrophilic/ hydrophobic regions,, adhesion 
properties, specific binding properties, e.g. cell binding 
or basement membrane binding properties, or the like, 
depending upon the intended use of the proiiein. As 
undesirable properties could be immunogenicity, proteolytic 
20 susceptibility, inflammatory activity, etc., when used in 
vivo, insolubility when used in vitro, and the like. 

The intervening sequence may include a wide variety of 
functional peptide sequences, such as the fibronectin 
binding site (RGD) , the laminin binding site, the fibrin 
25 gamma polsite, lipid or saccharide binding recognition 
sites, protease cleavage recognition sites, adhesion 
molecule recognition sites, e.g. lectin sites, and the like. 

Serving as intervening sequences may be all or a 
portion of the protein sequence in which the rea<:tive amino 
30 acid is found, combinations of such sequence with other 
sequences which provide particular properties, relatively 
random sequences, which may provide generally liydrophobic 
and/or hydrophilic properties, repetitive sequences of from 
about 3 to 30 amino acids, particularly naturally occurring 
35 sequences of"' from about 3 to 18 amino acids, where the 
repetition may be based on a motif, such as in collagen, 
rather than on the identical sequence being repeated, or 
combinations thereof. 
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Of particular interest as the intervening saquence is 
a portion of the protein in which the reactive amino acid 
consensus sequence is found, more particularly the sequence 
proximal to the consensus sequence, the N-sequence, the C- 

5 sequence or both. For example in the case of isopeptide 
formation using Factor XIII, a sequence from fibrinogen, 
preferably from the fibrin sequence, or a sequence from 
casein, particularly the sequence proximal to the consensus 
sequence , may be employed . Thus , one would have the 

10 naturally occurring sequence or fragment thereof repeated as 
a block polymer. Where one uses the naturally occurring 
sequence or mutated sequence thereof, there will usually be 
at least 10 amino acids from the natural protein, more 
usually at least about 15 amino acids, and usually not more 

15 than about 125 amino acids, more usually not more than about 
100 amino acids • The number of mutations will usually be 
fewer than 20 number %, more usually fewer than 10 number %, 
and conveniently fewer than 5 number %, generally being in 
the range of about 0 to 10 mutations, where the mutations 

20 may be deletions, insertions, transversions and transitions. 
One may have, as already indicated above, other seiquences to 
provide other functions to the protein polymer, where the 
sequences may be tandem or internal to the natural, sequence. 
Alternatively, the protein polymer may involve a 

25 repetitive sequence comprising relatively small units of 
from 3 to about 30 amino acids, particularly derived from a 
naturally occurring sequence • With reactive amino acid 
containing sequences or block copolymers, the repetitive 
units will generally be present in from about 2 to 30, 

30 usually 2 to 15, more usually 2 to 12, tandem units, 
depending on the number of amino acids in each unit, the 
desired length of the repetitive unit intervening sequence, 
the desired properties of the protein polymer, and the like. 
The protein polymer comprised of repetitive units will 

35 have a repetitive unit of from about 2 to 200, often from 
about 2 to 100, more often from about 3 to 30, usually 3 to 
15 amino acids, more usually 3 to 12 amino acids, and 
particularly 3 to 8 amino acids, where the repetitive unit 
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will normally be related to a naturally occurring repetitive 
unit. Naturally occurring proteins which have repetitive 
units as the main component of their structure, where the 
repetitive unit may differ as to some of the amino acids, 
5 but will have a motif which results in a particular 
structure or conformation, include collagen, where the motif 
requires every third amino acid to be glycine and that there 
be a relatively high proportion of proline at the remaining 
two sites, generally between about 10 to 45% of the total 

10 amino acids present; elastin (VPGVG) (SEQ ID NO: 01); fibroin 
(GAGAGS) (SEQ ID NO:02); keratin (KLK/ELAEA) (SEQ ID NO:03); 
or the like. Depending on the application, the polymer is 
desirably biocompatible, particularly resorbable. 

The repetitive units may be homopolymer units, 

15 alternating repetitive units, or block copolymer units, 
where a block has at least two repetitive units, or 
combinations thereof. If desired, different repetitive 
units may be present between the different recognition sites 
along the protein chain. Usually the intervening repetitive 

20 units will involve at least 2 repetitive units, jrrequently 
at least 3 repetitive units and may be 60 or more irepetitive 
units, where the average number of repetitive vinits will 
usually be at least about 2, more usually at least about 3 
repetitive units, often at least about 5 repetitive units, 

25 and not more than about 60, more usually not more than about 
30 repetitive units. By varying the selection of ]repetitive 
units, the size of the blocks, and the frecjuency and spacing 
of the consensus sequences, the physical, chemical and 
biological properties can be greatly varied. 

30 The protein polymer may have varying segutences for 

serving particular functions, such as separation, 
purification, chelation and the like. For example, one can 
have a string of at least 4 histidines, usually not more 
than about 12 histidines, which will serve to chelate metal 

35 ions to allow for separation and purification, as well as 
ease of identification. 

Instead of having a protein polymer with the reactive 
amino acid in the polymer chain, one may link an 
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oligopeptide comprising the reactive amino acid to a 
polymeric backbone. A high molecular weight carrier molecule 
of particular use is collagen; especially soluble 
atelopeptide collagen. Active peptides may be conjugated 

5 under conditions in which at least two reactive glutamine 
containing peptides are conjugated per collagen molecule. 
More preferably, 4 or more peptides may be conjugated to a 
single collagen molecule. The more cross-links forming to 
create the final adhesive, the greater the cohesive strength 

10 of the adhesive. The greater the concentration of potential 
cross-link sites, the more rapid the adhesive will set. 
Either property will yield a product of improved utility. 
Unconjugated or partially conjugated collagen pro\'ides ample 
lysine residues to serve as reactive amines. Therefore, a 

15 mixture of fully or partially conjugated peptide collagen 
and unconjugated collagen could serve as a useful adhesive 
substrate for factor Xllla. Native collagen by itself is a 
poor substrate for factor Xllla because it has no reactive 
glutamines. However, body tissues containing collagen and 

20 other proteins contain many lysines which may participate 
indiscriminately for cross-linking with the conjugate. 
Production of an adhesive mixture which has a stoichiometric 
excess of glutamine cross-linking sites over reactive amino 
groups will promote the cross-linking of the conjugate with 

25 the body tissue to which it is applied. - This will promote 
a stronger adhesive bond. 

One limitation with the use of collagen or o1:her native 
proteins for conjugation with active peptides is; that they 
are potentially animal derived and their properties may not 

30 suit the formulation needs of the product. For eicample, the 
solubility of atelocollagen is approximately 25 mg/ml in 
aqueous solution. If it becomes denatured during processing 
or formulation, its solubility falls to 15 mg/ml. Above 
these concentrations, collagen solutions are higlily viscous 

35 suspensions or gels that are extremely difficult to mix and 
will prevent the diffusion of crosslinking agents such as 
factor XIII. These limitations make nativis collagen 
impractical for use if the adhesive performanc:e requires 
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that the protein substrate concentration exceed 25 mg/ml. 
It is easy to presume that a protein concentration in excess 
of 25 mg/ml might be required considering that the protein 
substrate concentration of fibrin glue can be as tiigh as 110 
5 mg/ml (Immuno AG, Tisseel*** product specification). 

These limitations can be overcome by tlie use of 
synthetically designed and recombinantly produced protein 
polymers for conjugation. Of particular use are those 
protein polymers as described above which are high in 
10 molecular weight, contain amino acids whose side trhains lend 
themselves to convenient and controlled chemical 
modification such as lysine, cysteine, and glutamic acid, 
and are soluble in aqueous solution at concentrations of 50 
mg/ml or greater. 
15 The subject compositions may comprise one or more 

compounds, determined in part by whether a single compound 
can provide both the carboxy donor and amino donor entities, 
e.g. amino acids. Depending on the enzyme, the recognition 
sequence may be severely restricted or allow for various 
20 degrees of substitution, without significantly adversely 
affecting the binding affinity of the substrate sequence to 
the enzyme and the rate of the enzyme catalyzecl reaction. 
The composition will include at least one polymeric compound 
of at least 15 kD comprising at least two irecognition 
25 sequences providing the same reactive f unctionaility, with 
the reactive amino acid in the polymer chain or as a side 
group or the combination thereof. Where only one compound 
is present, the compound will have a plurality of 
recognition sequences which provide the carbox^' and amino 
30 reactive amino acids. Where there is more than one 
compound, the same or different compounds may provide one or 
the other of the reactive amino acids, conveniently, there 
will be one or two polymeric compounds, associated with one 
or two small molecules which have at least tvro reactive 
35 amino acids, to serve as cross-linkers of the polymeric 
compound (s) . The small cross-linking molecules will usually 
be at least about 150 D, usually at least about 200 D and 



14 



wo 95/23611 



PCTAJS95/02728 



not more than about 10 kD, usually not more than about 5 kD 
in molecular weight. 

The isopeptide substrate sequence may be any 
recognition site, comprising one or a plurality of amino 
5 acids, e.g. a single amino acid or an amino acid sequence of 
3 or more amino acids, recognized by an enaiyme which 
recognizes the recognition site in producing an isopeptide 
bond between an amino group and a car boxy group. For the 
most part, the reaction will involve at least ones naturally 
10 occurring or mutated consensus sequence, particularly 
involving the carboxy donor amino acid. Therefore, for the 
most part, the reaction will involve a carboxy clonor amino 
acid, e.g. the carboxamide of glutamine, in a consensus 
sequence and another molecule having an amino group, 
15 particularly an amino acid, which may be a diamine or 
polyamino compound having from about 2 to 5 ami.no groups, 
conveniently an oligopeptide, or part of a polypeptide or 
protein. Both naturally occurring or mutated secjuences may 
be employed, which are active in the format J.on of the 
20 isopeptide bond. The naturally occurring sequeince may be 
from any convenient source, which is recogni::ed by the 
enzyme which catalyzes the formation of the isopeptide bond. 
Depending on the use of the subject composition where the 
composition is to be used in the treatment of a mammalian 
25 host, particularly a human host, the sequence will normally 
be biocompatible, so as to avoid a strong immune response or 
the need for immunosuppression. 

The consensus sequence may be derived from fibrinogen, 
fibronectin, myosin, the microtubule-associated protein 
30 "tau", whether primate or other mammalian species, 
particularly human, casein, or from any other protein, where 
the protein has a consensus sequence, which is a substrate 
for an available enzyme for the isopeptide bond formation, 
which consensus sequence and enzyme can be used in the 
35 applications for the subject compositions. Human fibrin has 
the sequence (residues 421 - 437) (GEGQQHHLGGAKQAGDV) (SEQ ID 
N0:04); bovine casein has the sequence (residues 162-175) 
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(VLSLSQSKVLPVPE) (SEQ ID NO: 05); other sequences may also 
find use. 

The mutated sequences may be trxincated sequences, 
particularly for removal of one of the amino acids involved 
5 with the isopeptide formation. It is found thiat FXIIIa 
enzyme is relatively specific as to the amino acids 
contiguous to the glutamine, but not the lysine. Therefore, 
as to the glutamine donor, the contiguous amino aicids will 
be, for the most part, the naturally occurring amino acids, 
10 there usually being at least 5 amino acids of the consensus 
sequence, usually at least 2 on each side of the glutamine. 
As for the lysine, the contiguous amino acids may be 
modified and the consensus sequence truncated, while still 
retaining activity. Usually the sequences for the glutamine 
15 will have at least 6, more usually 8 amino acids, and may be 
as many as 60 amino acids, usually not more than about 45 
amino acids, more usually not more than about 36 amino 
acids. The number of amino acids in the enzyme riicognition 
site sequence depends upon the nature of the repetitive 
20 sequence, the length of the repetitive sequence, the source 
of the consensus sequence, whether the naturally occurring 
sequence or the mutated sequence, the inclusion of 
additional amino acid sequences providing for additional 
functionalities, and the like. For lysine, there need be 
25 none of the consensus amino acids, there usually being a 
total of at least 3 contiguous amino acids of the consensus 
sequence, including the lysine. Instead of lysine, amino 
groups may be provided, particularly where the amino group 
is bonded to a methylene group, more particularly to a 
30 polymethylene of at least two methylene groups. 

The small molecules which find use will have at least 
two reactive sequences to serve as cross-linking agents for 
the polymer. The small molecules may be aliphatic, 
alicyclic, aromatic, heterocyclic or combinations thereof, 
35 usually at least partially aliphatic. The small molecules 
may be non-oligomeric or oligomeric, e.g. oliq:opeptides. 
The molecules may be hydrophilic or hydrophobic. The small 
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molecules may provide the carboxamide group, the a:nino group 
or both, usually the amino group. 

The small molecules used in conjuction with the 
polymeric molecule (s) provide for a number of opp<Drtunities 
5 not available using solely polymeric molecules. In this 
way, cross-linking may be controlled, where the polymer only 
has one of the amino acids involved with the isopeptide 
link, and the cross-linking agent becomes exhausted or may 
be removed from the polymer. By using the small cross- 
10 linking agent, the degree of cross-linking can be better 
controlled and the polymer may have a large number of 
recognition sites. Also, depending upon the nature of the 
recognition site, e.g. Q or K, the small molecules may serve 
as cross-linking agents to adjacent compositions, such as 
15 tissue or other substrate. 

Various polymeric compounds may be employed to produce 
the cross-linked product- As already indicated, the 
recognition site(s) may be part of the polymeric backbone or 
a side chain to a polymeric backbone. Where the lecognition 
20 site is part of the polymeric backbone, the polymer will 
usually be a protein, although an oligopeptide block may be 
co-polymerized, usually with a polymer other than a protein 
polymer using chemical linkage. However, where the 
recognition site is a side chain, the polymei* may be a 
25 protein or other physiologically and en;symatically 
compatible polymer. 

The subject polymers may include polymtars having 
recognition sites for enzyme catalyzing reactions proximal 
to the termini, with an intervening sequence, particularly 
30 an intervening sequence of repetitive units as described 
above. However, for the most part the subject polymers 
comprising recognition sites for enzyme catalyzed reactions 
resulting in new covalent bonds and cross-linking may be 
generally depicted by the following formula: 

35 

*-t{*-n)Por {n-t)P or Z»o-*i 

wherein: 
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^ and ^1 may or may not be present, if present, are the 
same or different and are of not more than about a total of 
125 amino acids, usually not more than a total of about 70 
amino acids, and usually differing from the intervening 
5 sequences of the polymer, but may include one or more 
recognition sites, generally being not more than about 10 
number % of the amino acids of the polymer, usually not' more 
than about 5 nximber % of the amino acids of the polymer; 

$ represents the intervening sequences or spacers of 
10 the protein polymer, where the intevening sequence may be 
free of any repetitive motif or may comprise repetitive 
units, usually comprising at least two repetitive; units and 
usually not more than about 60 repetitive units, generally 
comprising from about 3 to 30 repetitive units, where the 
15 repetitive units may be the same, alternating different 
repetitive units of 2 or more, usually 2, and blocks of 
different repetitive units; 
£ intends { (n-*)'''}„n, 

p is an integer of from 2 to about 10, indicating that 

20 there are that number of domains of the intervening sequence 
and the reactive amino acid containing sequence, where each 
of the domains may be the same or different, there usually 
being not more than about 6 different domains, moro usually 
not more than about 3 different domains, frequently there 

25 being from 1 to 2 different domains; 

p' is an integer of from 1 to about 9, other^rise coming 
within the definition of p; 

n is a functional amino acid sequence, which may be 
the same or different, usually the same, each time n is 

30 repeated; at least 2 n include the reactive amino acid, 
normally in an enzyme recognition site, as appropriate for 
the particular enzyme, and may involve hydroxy, carboxy 
(includes the acid, ester and amide), amino, phospho, or 
other functionality for forming a covalent bond, 

35 individually or in combination; for isopeptide formation, 
the amino acid sequence may comprise one or both, carboxy 
and primary amino, which may be a consensus sec[uence or a 
mutated sequence, which may have one or both of the active 
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Q and K; for Q, generally being at least 5 amino acids and 
not more than about 60 amino acids, usually not more than 
about 30 amino acids, where one or more reactive sequences 
may be present; for K, only lysine need be present, 
5 preferably there being at least 3 contiguous amino acids of 
the consensus sequence; other functional sequences include 
GGAKQAGDV (SEQ ID NO: 06), and the like, the sequences 
generally being at least 3 and not more than about 60 amino 
acids, frequently not more than about 45 ami.no acids; 
10 instead of or in addition to having a reactive amino acid, 
n may have a functional amino acid or sequence involved with 
other properties of interest, such as proteolytic cleavage 
sites, cell binding sites, adhesion sites, etc*; and 

n will vary with p and the number of amino acids in 
15 n-*, where n is at least 1, and n x p is usually at least 2, 
and not more than about 75, usually not more than about 60. 

One group of block copolymers of the subject: invention 
will, for the most part, comprise, individually or in 
combination, silk-like sequences, particularly CAGAGS (SEQ 
20 ID NO: 07), and elastin-like sequences, particularly VPGVG 
(SEQ ID NO: 08) , where the repeating units will hn in blocks 
of at least 2 repeating units, more usually at least about 
4 repeating units, and generally not more than about 32 
repeating units, more usually not more than about 24 
25 repeating units • Usually, the number of repeating units in 
the block of the silk repeating unit will not be more than 
about 2 times the number of repeating units in the block of 
the elastin repeating unit, usually not more than about 1 
time the number of repeating units, and will generally be at 
30 least about 0.1 time the number of repeating units of 
elastin in the elastin block. For the most part, the 
elastin block will have at least 4 repeating units, more 
usually at least about 8 repeating units, and up to about 32 
repeating units, more usually up to about 24 repeating 
35 units. By contrast, the silk block will have at least about 
2 repeating units and not more than about 4 repeating units, 
usually not more than about 16 repeating units, and 
preferably not more than about 8 repeating units. 
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Preferred compositions will have from about: 10 to 60 
percent of silk repeating units, more usually from about 20 
to 55 percent of silk repeating units, where the ratio of 
elastin repeating units to silk repeating units per block 
5 will generally be in the range of from about 4:1 to 1:1. 

One repetitive unit homopolymer employs the collagen 
motif, where each repeating unit block has from 2 to about 
10 different triads, usually from about 2 to about 6 
different triads, in the repeating unit block, between 
10 functional sequences. The number of triads will generally 
be at least about 3 and not more than about 36, usually at 
least about 5 and not more than about 25 triad repeating 
units. The number of prolines will be below aboui: 45 number 
% of the amino acids in the repeating unit block, generally 
15 having on the average not more than about 1.2, usually not 
more than about 1, proline per triad. 

By varying the length of each block, in the case of 
block copolymers, the ratio of amino acids of one block in 
relation to the amino acids of the other block, the choice 
20 of the repetitive units, the number of functional sequences 
and their location, e.g. terminal or internal, whether 
there can be internal cross-linking or only intermolecular 
cross-linking, the properties of the products may be greatly 
varied. For example, resorption rates can be greatly 
25 varied, where in an elastin/f ibroin block copolymer, 
resorption will be enhanced with higher proportions of the 
elastin repeat unit. The ability to promote hemostasis or 
cell attachment and migration can be varied. Also, various 
physical properties, such as solubility, adsoiT)tion to 
30 tissues, tensile strength, cohesive strength, elongation, 
and set times, can be substantially retained or varied, so 
as to provide the necessary physical properties for the 
intended application. 

In the case of the presence of small repetitive units, 
35 particularly ' of from about 3 to 18 amino acids, the 
proportion of the total amino acids contributed by any one 
repeating block or domain to the total number of amino acids 
may vary widely, from a range of about 5 number % to about 
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95 number %, usually ranging from about 15 number % to about 
80 number %, more usually not less than about 20 number % 
and up to about 75 number %. 

As already indicated, instead of using polymers where 

5 the enzyme recognition site sequence, which includes by 
definition the reactive amino acid by itself or in 
conjunction with other amino acids, is in the chain of the 
polymer, the enzyme recognition site sequence may be a side 
chain. A wide variety of polymers may be used as the 

10 backbone for the enzyme recognition site or reactive side 
chains. The choice of polymer will vary in accoi^dance with 
the intended application and may be naturally occurring, 
synthetic or combinations thereof. Such polymers include, 
but are not limited to, synthetic polymers, both addition 

15 and condensation polymers, such as polylactides, 
polyglycol ides , polyanhydr ides , polyorthoesters , polyvinyl 
compounds, polyolefins, polyacrylates, polyethylene glycol, 
polyesters, polyvinyl alcohol, polyethers, copolymers 
thereof, and naturally occurring polymers, such as collagen, 

20 atelopeptide collagen, fibrinogen, keratin,, casein, 
chitosan, heparin, dextran, cellulose, glycosaminoglycans, 
hyaluronic acid, and the like. 

The reactive side chains which are attaclied may be 
varied widely, depending on the nature of the functionality 

25 required for enzyme catalyzed reaction and the r<squirements 
for recognition by the enzyme for reaction. In the case of 
isopeptide formation, for example, it will depend on whether 
the side chain comprises a reactive glutamine and lysine, or 
one or the other. So far as the lysine is concerned, when 

30 synthesizing the side chain, the lysine may be substituted 
with a polymethylene primary amine, usually having at least 
three methylene groups, where the reactive portion is bonded 
to a group which can be linked to the polymer backbone. The 
glutamine comprising reactive sequence will usually comprise 

35 at least about 8 amino acids to be efficiently recognized by 
FXIIIa, while the lysine reactive group may contain no amino 
acids or may contain 1 or more amino acids, conveniently 
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comprising at least 6 amino acids, where the natural 
sequence is employed. 

Of particular interest are polymers having short repeat 
units comprising a reactive linking amino acid, particularly 
5 carboxy, amino, and thiol functionality, such as D, E, K, R, 
and C. Particularly, the unit will be of from 3 to 10, 
usually 3 to 6 amino acids, where 1 or more amino acids will 
be glycine or alanine, usually fewer than 100% of the amino 
acids other than the reactive amino acid, generally being 
10 from about 29 to 75% of the total number of amino acids. 
Conveniently, one may replace one of the amino acids of a 
repeating unit with the reactive amino acid, so that the 
structure of the polymer is not significantly modified. 

The number of side chains will be at least about 2, 
15 usually at least 4, and generally not more than about 30, 
usually not more than about 20. Since as the polymer 
becomes cross-linked, the accessibility of the side chains 
to the enzyme becomes diminished, so that increasing the 
number of side chains beyond a certain minimun will not 
20 provide any advantages as to setup time and strength. 

Depending on the functionality of the reactj.ve linking 
amino acid, various cross-linking compounds may be used. In 
the cross-linking compound, active olefins may be used with 
either amino or carboxy groups, while thiol groups may be 
25 used with amino or carboxy groups. Amino groups on the 
polymer may be functionalized with maleic anhydride to 
provide for an active olefin on the polymer. Usually, the 
two functionalities on the linking compound will be 
different and the spacer between the two functionalities 
30 will usually be aliphatic or carbocyclic aromatic. The 
spacer will usually be fewer than 36 carbon atoms, usually 
fewer than 20 carbon atoms, generally being ot:her than a 
bond, usually being at least one carbon atom. 
Alternatively, the pendent sequence may be directly bonded 
35 to a functionality on the backbone polymer, where the 
functionality on the pendent group and the backbone polymer 
are compatible for bonding. 
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Generally, the side chain will be at least about 3 
amino acids long, usually at least about 5 amino acids long 
and generally from about 5 to 16 amino acids long, usually 
not more than about 12 amino acids long* As indicated 
5 previously, with lysine and transglutaminases, an amino 
functionality can suffice. Usually, there will be at least 
one glycine or alanine, and up to 50% or more of the amino 
acids other than the active amino acid may be glycine, 
depending on the consensus sequence required for activity, 
10 For transglutaminases, the consensus sequence comi»rising the 
glutamine may include LGPGQSKVIG ( SEQ ID MO : 4 9 ) or 
GEGQQHHLGG (SEQ ID NO: 50). Of particular interest are 
backbones having fibroin and elastin repetitive units coming 
within the description provided previously, where one of the 
15 fibroin or elastin units has one of the amino acids 
substituted with the reactive amino acid. Particularly, by 
having a single available cysteine in the pendent group 
(there may be more than one cysteine, so long as; the other 
cysteines are unreactive, e.g. protected with a removable 
20 protective group) , the pendent groups may be readily 
attached to the backbone polymer by means of thioether 
formation with an activated olefin. 

The subject compositions find particular use in the 
formation of articles of manufacture, by themselves or in 
25 combination with other materials. In one application, 
articles may be produced for use internally to a mammalian 
host, where there is an interest in biocompatibility , 
resorption rate, ability to vascularize, tissvie adhesive 
and/or bonding capability, and the like. Various articles 
30 can be prepared, such as gels, films, threads, coatings, 
formed objects, such as pins and screws, or injectable 
compositions which are flowable, where the injectable 
composition may set up and bond or seal tissues, form a 
depot for a drug, or be a filler, coating, or the like. The 
35 injectable composition may be administered with a syringe, 
catheter, trocar, or the like. The formed objects may be 
prepared in accordance with conventional ways, such as 
molding, extrusion, precipitation from a solvent, solvent 
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evaporation, and the like. The f lovable depot can be 
obtained by using a molecular dispersion, fine particles in 
a medium saturated with the polymer, using a melt, where the 
melting temperature may be achieved by adding 
5 physiologically acceptable additives, and the like. 

The articles may find use in a variety of situations 
associated with the implantation of the article into a 
mammalian host or the application of the article to the 
surface of a mammalian host, e.g. wound healing, bum 

10 dressing, etc. Those situations, where the performance of 
the article may be retained for a predetermined, time and 
replaced by natural materials through natural jprocesses , 
desirably employ materials which will be resorbed after 
having fulfilled their function in maintaining their role 

15 until the natural process has reestablished a natural 
structure. Thus, the compositions may find use in holding 
tissue together, covering tissue, encapsulating cells or 
organs, providing a coating that cells can invade and 
replace the composition with natural composition, e.g. bone, 

20 soft tissues, and the like. 

To enhance the rate of setup of the polymeric 
composition, the composition may be prepolymeriz.ed. When 
prepolymerized, usually at least about 10% of the total 
number of cross-links which are present upon completion of 

25 the cross-linking reaction may be formed and usualy not more 
than 75%, more usually not more than about 50%. Depending 
on the utility of the product, the number of cross-links 
introduced with prepolymerization should allow the 
prepolymerized composition to remain workable and provide 

30 sufficient time prior to setup where it is :no longer 
workable for application. 

Alternatively, one may wish to provide for a relatively 
constant supply of a particular agent, particularly a drug, 
whereby a depot may be introduced into the mamma;lian host 

35 which will degrade over a predetermined time. The depot may 
be prepared from the resorbable composition and drug, so 
that as the external surface of the depot is erctded, drug 
will be released. By controlling the form of the depot, 
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whereby a relatively constant volume of the resorbable 
material is degraded over an extended period of time, the 
drug level may be maintained during that period. 

The period required for resorption can be ass short as 
5 0.5 days and may exceed 4 weeks, 6 weeks, 8 weeks or more 
depending upon the particular choice of composition. Thus, 
the period of maintenance of the composition may he greatly 
varied. 

The subject compositions may be used to provide 
10 compositions where various functionalities may be affixed to 
a backbone polymer at will. Tor example, where the polymer 
has been affixed in place, by adding less than saturating 
amounts of a low molecular weight isopeptide bond forming 
reactant from time to time, one can affix functionalities or 
15 activities, e.g. radioactivity, fluorescence, light 
absorption, magnetic particles, etc. to the site. By adding 
a compound comprising a reactive sequence and functionality 
with FXIIIa to the polymer, whereby isopeptide bond 
formation will occur, the functionality or activity will 
20 become covalently bonded to the polymer. 

The subject compositions may also be used in assays, 
where one can determine the amount of analyte by having a 
competition between a conjugate of the analyt<s with the 
enzyme FXIIIa or a small molecule having a reactive sequence 
25 which forms the isopeptide link, e.g. lysine, where the 
polymer has glutamine. Competitive assay protocols are well 
known in the art and do not require exemplification here. 

The compositions may be prepared in accordance with 
conventional ways. For the polymers which have the 
30 consensus sequence in the polymer chain, a method which may 
be employed is described in U.S. Patent No. 5,243,038. 
Briefly, sequences may be synthesized comprising a plurality 
of repeating units, where complementary sequences result in 
dsDNA having overhangs. A series of dsDNA molecules may be 
35 prepared and stepwise introduced into a cloning vector as 
the gene for the protein is constructed. A unit can be 
obtained in this way, which may be sequenced to ensure that 
there have been no changes in the sequence, followed by 
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multimerization of the unit, cloning and expression. For 
fxirther details, see the above-indicated patent. 

For the compositions, where the reactive sequences are 
side chains, one can provide for a wide variety of active 
5 functionalities when synthesizing the side chain, which will 
allow for covalent bonding to the backbone pol^Tner. , The 
backbone polymers may be modified, as appropriate, for 
covalent linkage. The synthetic polymers may be modified by 
oxidation to provide hydroxy 1 groups, sulf ©nation to provide 

10 sulfonyl groups, and the like. During the polymerization, 
monomers may be introduced which provide for react:ive sites. 
With the lactides and glycolides, 4 -hydroxy but-2-enoic acid 
may be employed as a comonomer, and the like. For example, 
a mercaptan group may be provided, where the poljiaer has an 

15 active olefinic group. Alternatively, carboxyl, hydroxy 1 or 
amino groups may be present, which allow for attachment. 
Because of the variety of polymers and groups which may be 
present on the side chain, no general methodology can be 
described. With the naturally occurring polymers, there 

20 will normally be present a large number of active 
functionalities for linking. Methods of linkincf compounds 
to such naturally occurring polymers are well known in the 
art. 

The following examples are offered by way of 
25 illustration and not by way of limitation. 

EXPERIMENTAL 

EXAMPLE 1 

Methods 

30 The construction of synthetic DNA and its use in large 

polypeptide synthesis is described in U.S. Patent No. 
5,243,038; PCT/US89/05016 and PCT/US92/09485, the 
disclosures of which are herein incorporated by reference. 
Modifications to these methods and additional methods used 
35 are described below. 

1. Use of filters and columns for DNA Purification 
A. Ultrafree®-Probind filter unit ("Probind", 
Millipore) : the DNA containing solution was applied to the 
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filter unit and spun at 12,000 RPM for 30 seconds in a 
Sorvall Microspin 24S. 

B. Microcon-3d filter (Amicon) : the DNA 
containing solution was washed by applying to the filter and 

5 exchanging twice with HjO by spinning at 12,000 IIPM for 6 
min in a microfuge. 

C. Bio-Spin 6 coliunn ("Bio-Spin", BioBad) : Salts 
and glycerol were removed from the DNA solution by applying 
to the column, previously equilibrated in TEAB (triethyl 

10 ammonium bicarbonate pH 7.0) , and spinning in a Sorvall RC5B 
centrifuge using an HB4 rotor at 2,500 RPM for 4 min. 

2. Phosphatase treatment of DNA 

Phosphatase treatment of DNA was also peirformed by 
resuspending ethanol precipitated DNA from the restriction 

15 enzyme digest in 20 mM Tris-HCl pH 8.0, 10 mM MgCljto a 
final DNA concentration of 20 ^g/ml. Shrimp Alkaline 
Phosphatase (SAP) was added at 2 U/Mg of DNA and the mixture 
was incubated at 37'*C for one hour, heat inactivated for 20 
minutes at 65»C and then passed through a Probind filter and 

20 subsequently a Bio-Spin column. 

3. Preparative agarose gel electrophoresis 

For agarose ligation, the buffer used was Ix TAE (50 mM 
Tris-acetate, pH 7.8). 

4. Agarose DNA Ligation 

25 The agarose was melted at 65 «C, the temperature was 

then lowered to 37 *»C and ligation buffer (5x = 100 mM Tris- 
HCl, pH 7.5, 50 mM MgClj, 50 mM DTT, 1 mM ATP) was added; 
the tube was then placed at room temperature and ligase was 
added (1000 units T4 DNA ligase (NEB)). The reaction volume 

30 was usually 50 ;xl. The reaction was incubated at 15 **C for 
16-18 hrs. 

5. Agarose DNA purification using an Ultraf ree*-MC 
Filter Unit 

This procedure can be used for agarose slices up to 400 
35 Ml in size. After agarose gel electrophoresis, the DNA is 
visualized by ethidium bromide staining and the agarose 
block containing the DNA band of interest is e>:cised. The 
agarose is then frozen at -20*»C for 1 hour, then quickly 
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thawed at 37*»C for 5 minutes. The agarose is then 
thoroughly macerated. The pieces are then transferred into 
the sample cup of the filter unit and spun at 5,000xg in a 
standard microfuge for 20 minutes. The agarose is then 
5 resuspended in 200 ixl of Tris-EDTA, or other buffer, and 
incubated at room temperature for 3 0 minutes to allow for 
elution of additional DNA from the gel. The mixture is then 
centrifuged for an additional 20 minutes at 10,000 RPM. The 
DNA is, at this point, in the filtrate tube separated from 
10 the agarose fragments and ready for subsequent DNA 
manipulations • 

6. Preparation of antibody to artificially 
synthesized peptides 

Following the same procedure as described in U.S. 

15 Patent No. 5,243,038, an additional antigen was synthesized 
having the sequence (GAPGAPGSQGAPGLQ)2YMK (SEQ ID NO: 09) 
which was then coupled to keyhole limpet hemocyar in for use 
as an immunogen. Polyclonal antisera ("CLP antibody") were 
then prepared which bound to the CLP 3.7 and PPAS polymers 

20 described below. 

7. Immunoblotting of proteins in gels 

An alternative to the '^I-Protein A detection method 
was used. This method relied on a chemi luminescent signal 
activated by horseradish peroxidase (H]=IP) . The 

25 chemi luminescent reagents are readily available from several 
suppliers such as Amersham and DuPont NEN. The wcistern blot 
was prepared and blocked with BLOTTO. A number of methods 
were used to introduce the HRP reporter enzyme including, 
for example, a hapten / ant i-hapten-HRP, a biotinylated 

30 antibody/ streptavidin-HRP, a secondary reporter such as a 
goat or mouse anti-rabbit IgG-biotinylated/ strepta vidin-HRP, 
or a goat or mouse-anti rabbit IgG-HRP. These rezigents were 
bought from different sources such as BioRad or Aitiersham and 
occasionally biotinylated antibodies were prepared in our 

35 laboratory using Biotin NHS from Vector Lal:)oratories, 
Burlingame, CA. (Cat. #SP-1200) following the procedure 
accompanying the product. The following is an e>:ample of a 
procedure used to detect the expression of protein polymers. 
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The blot was placed in 15 ml of BLOTTO solution 
containing biotinylated goat anti-rabbit IgC (BioRad) 
diluted in BLOTTO (1:7500) and gently agitated for 2 hrs at 
room temperature. The filter was then washed for 30 minutes 
5 with 3 changes of TSA (50 mM Tris-HCl pH 7.4, 0.9% NaCl, 
0.2% sodium azide) . The blot was then incubated for 20 
minutes at room temperature with gentle rotation, in 20 ml 
of TBS (100 mM Tris Base, 150 mM NaCl, pH 7.5) HRP- 
Streptavidin (Amersham) diluted 1:1000 in TBS with 0.1% 

10 Tween 20. The blot was then washed three times for 5 minutes 
each in TBS with 0.3% Tween 20 and then three times for 5 
minutes each in TBS with 0.1% Tween 20. The blot was then 
incubated for 1 minute with gentle agitation in 12 ml of 
development solutions #1 an #2 (Amersham) equally mixed. The 

15 blot was removed from the development so].ution and 
autoradiographed . 

8. Protein expression analysis 

An overnight culture which had been grown at 30*> C was 
used to inoculate 50 ml of LB media contained i.n a 250 ml 

20 flask. Kanamycin was added at a final concentration of 50 iiq 
per ml and the culture was incubated with agitation (200 
rpm) at 30* C. When the culture reached an OD^oo of 0.8, 40 
ml were transferred to a new flask prewarmed at: 42* C and 
incubated at the same temperature for approximately 2 hours. 

25 The cultures (30* and 42*) were chilled on ice and OD^ooWas 
taken. Cells were collected by centrifugation and then 
divided in 1.0 OD^oo aliquots and used to perform western 
analysis using the appropriate antibodies. 

9. Amino acid analysis 

30 Amino acid derivatives were analyzed by reverse phase 

HPLC using a Waters 600E system. 

10. Peptide Synthesis 

Synthetic peptides were also prepai-ed on a 
Rainin/Protein Technologies PS3 FMOC peptide synthesizer. 
35 Both the synthesis and cleavage were accomplished using the 
methods supplied by the manufacturer in the instrument 
manual. 
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11. In vitro DNA synthesis 

The 6-cyanoethyl phosphoramidites , control led -pore 
glass columns and all synthesis reagents were obtaiined from 
Applied Biosystems, Foster City, California. Synthetic 
5 oligonucleotides were prepared by the phosphite triester 
method with an Applied Biosystems Model 381A DNA synthesizer 
using a 10-fold excess of protected phosphoramidites and 0.2 
Minole of nucleotide bound to the synthesis suppoi*t column. 
The chemistries used for synthesis are the standard 

10 protocols recommended for use with the synthesize:: and have 
been described (Matteucci et al., J. Amer. Chem. Soc, 
103:3185-3319 (1981)). Deprotection and cleavaqre of the 
oligomers from the solid support were performed ac(3ording to 
standard procedures as provided by Applied Biosysjtems. The 

15 repetitive yield of the synthesis as measured by the optical 
density of the removed protecting group as recommended by 
Applied Biosystems was greater than 97.5%. 

The crude oligonucleotide mixture was purified by 
preparative gel electrophoresis as described by the Applied 

20 Biosystems protocols in Evaluating and Isolating Synthetic 
Oligonucleotides, 1992 (Formerly: User Bulletin :l3, 1987). 
The acrylamide gel concentration varied from 10 to 20% 
depending upon the length of the oligomer. If necessary, 
the purified oligomer was identified by uv ishadowing, 

25 excised from the gel and extracted by the crusht and soak 
procedure (Smith, Methods in Enzvmoloav . 65:371-379 (1980)). 

For DNA synthesis of oligonucleotides longer then 100 
bases, the synthesis cycle was changed from the protocol 
recommended by Applied Biosystems for the 381A DNA 

30 synthesizer. All the reagents used were fresh. All the 
reagents were supplied by Applied Biosystems except for the 
acetonitrile (Burdick and Jackson Cat#017-4 with water 
content less then 0.001 %) and the 2000 A pore size column 
(Glen Research) . Due to the length of the oligo, interrupt 

35 pauses had to be inserted during the synthesis to allow 
changing the reagent bottles that emptied during synthesis. 
This interrupt pause was done at the cycle entry step and 
the pause was kept as short as possible. The washes after 



30 



wo 95/23611 



P<TAJS95/027M 



detritylation by TCA, through the beginning of each 
synthesis cycle, were increased from about 2x to :jx over the 
recommended time. The time allocated for the capping was 
also increased to limit truncated failure sequences. After 
5 the synthesis the deprotection was done at 55**C for 6 hours. 
After desalting the synthesized DNA was amplified usin^ PGR. 

12. Sequencing of DNA 

Storage and analysis of data utilized software from DNA 
Strider, DNA Inspection lie or DNAid for Apple; Macintosh 
10 personal computer. 

13. Dideoxy DNA sequencing of double stranded plasmid 

DNA 

As described in U.S. Patent No. 5,243,038, plasmid DNA 
was prepared on a small scale. Primers were jsynthesized 

15 using a DNA synthesizer and were annealed to the iplasmid DNA 
following the procedure described for M13 sequesncing. The 
sequencing reactions were done using Sequenase (United 
States Biochemicals) and the conditions were as recommended 
by the supplier. All sequences were run on polyacrylamide 

20 gels. 

14. PGR Amplification 

The PGR reaction was performed in a 100 ^1 volume in a 
Perkin Elmer thin-walled Gene Amp*™ reaction tube. 
Approximately 1 /iM of each primer DNA was added to Ix PGR 

25 buffer (supplied by Perkin Elmer as lOx solution) , 200 of 
each dNT, 5U AmpliTaq, and several concentrations of the 
target DNA. Amplification was perfoinmed in a Perkin Elmer 
DNA Thermal cycler model 480 for 30 cycles with the 
following step cycles of 12 min each: gS'^C, 62'*C, and 72«C. 

30 Aliquots from the different reactions were analyzed by 
agarose gel electrophoresis using 1.5% low meeting point 
agarose in 0.5x TA buffer. The reaction mixtures that gave 
the desired band were pooled and spun through a Probind 
filter to remove the AmpliTaq enzyme, then a !«icrocon-30 

35 filter and a Bio-Spin column. The DNA was then concentrated 
in vacuo. 

15. Fermentation conditions 
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The fennentors used for the expression cf protein 
polymers were usually a 15 1 MBR, 10 1 working volvime, or a 
13 1 Braun Biostat E, 8.5 1 working volume. The choice of 
the fermentor and its size is not critical. Any media used 

5 for the growth of coli can be used. The nitrogen so\irce 
ranged from NZAmine to inorganic salts and the cari)on source 
generally used was glycerol or glucose. All fermentations 
were done with the appropriate selection conditions imposed 
by the plasmid requirements (e.g. kanamycin, ampicillin, 

10 etc.). The fermentation method used to express protein 
polymers in E. coli was the fed -batch method. This is the 
preferred method for the fermentation of recombinant 
organisms even if other methods can be used. 

The fed-batch method exploits the stage of c:ell growth 

15 where the organisms make a transition from exponential to 
stationary phase. This transition is often the result of 
either depletion of an essential nutrient or accumulation of 
a metabolic byproduct. When the transition is the result of 
nutrient depletion, the addition of nutrients to the system 

20 causes cell division to continue. One or more essential 
nutrients can incrementally be added to the fermentation 
vessel during the run, with the net volume increassing during 
the fermentation process. The result is a controlled growth 
rate where biomass and expression levels can be optimized. 

25 When the cell number in the culture has reacrhed or is 
approaching a maximum, protein polymer production is induced 
by providing an appropriate physical or chemical signal, 
depending upon the expression system used. Production will 
then continue until the accumulated product readies maximum 

30 levels (Fiestchko, J., and Ritch, T. , Chem, Eng. Commun. 
1986, 45: 229-240. Seo, J.H.; Bailey, J.E., Biotechnol. 
Bioeng. 1986, 28: 1590-1594. 

35 Factor Xllla reactive peptides 

The sequence of fibrin and of the native cross-linking 
site is known. The bovine milk protein B-casein is a known 
substrate for factor Xllla. Peptide blocks whidi include a 
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factor Xllla cross-linking site and retain activity towards 
factor Xllla were produced. These peptide blocks or similar 
amino acid sequences were then conjugated to high molecular 
weight carrier polymers or were used in the construction of 

5 protein polymers. When a formulation (aqueous and 
physiological) containing such polymers is mixed ^rith factor 
Xllla, it will undergo cross-linking leading to a setting 
reaction in which the polymer solution will be converted to 
a stiff gel or clot. The degree and spacing of cross-linking 

10 will influence the setting time and mechanical properties 
and cohesive strength of the gel. When applied on or in a 
tissue, an adhesive bond will be created boIJi through 
physical adsorption to the tissue matrix and through 
covalent bonding to available tissue proteins. 

15 A synthetic peptide was synthesized contiaining the 

amino acid sequence VLSLSQSKVLPVPE (SEQ ID NO: 10) (peptide 
93.1) corresponding to residues 162-175 of bovine B-casein 
as published by Dumas, B.R., Brignon, G, , Grosclaude, F., 
Mercier, J.C. (1972) Eur. J. Biochem. 25, 505-514. The 

20 peptide was shown to serve as a substrate for factor Xllla 
cross-linking using HPLC analysis. A solution containing 
the peptide was incubated with thrombin activated factor 
XIII in the presence of excess monodansylcadavezine (MDC) . 
MDC is a fluorescent amino group containing comfiound which 

25 serves as a lysine analog for factor Xllla. The reaction 
products were separated by reverse phase high (»erformance 
liquid chromatography. The unreacted peptide peak migrated 
with a retention time of 21 minutes. A new peak with a 
retention time of 23.5 minutes was observed in the reaction 

30 mixture and its area was proportional to the time of 
reaction. The new peak was isolated and its atomic mass was 
determined by mass spectrometry to be 1815. Ttie combined 
mass of peptide 93.1 (1496) plus MDC (335) is 1831. If the 
transglutaminase activity of factor Xllla forms an 

35 isopeptide bond between the peptide glutamine side chain and 
an available primary amino group, the reaction should 
liberate an ammonium ion, NH3+ (NHj from the glutamine amide 
and H from water) according to the reaction belcw: 
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Factor Xllla Transglutaminase Reaction 
Reactants : 

Peptide 93.1-glutamine-CH2CH2CX>-NH2 + NH2-CH2CH2CH2CH2CH2NH-Dan8yl + H20 
inw«1496 mw»335 

5 

Products: 

Peptide 93.1-glutamine-CH2CH2CO-NH2-CH2CH2CH2CH2CH2NH-Dani9yl + NH,+ OH- 

mw-1815 

10 The difference between the sum of the molecuJLar weights 

of the substrates and the theoretical product is; 16 atomic 
units, the loss of NH^. The mass of the new peak matches 
exactly the mass of the theoretical reaction product of 
Peptide 93,1 and MDC by factor Xllla. No other combinations 

15 of reaction products match the mass of the new peeik. 
Therefore, it is concluded from this data that factor Xllla 
will create a covalent bond between Peptide 93.1 and a 
compound containing an active amino group. The conversion 
of Peptide 93.1 to MDC-Peptide 93.1 occurred with a Km of 

20 approximately 1.8 x 10*^ M. 

According to the sequence of human fibrin cramma chain 
(Rixon, M.W., Chung, D.W. and Davie, W.W. , Biochiamistrv 22, 
2077-2086, 1985), the carboxyl terminal 17 amino acids 
(residues 421 to 437, GEGQQHHLGGAKQAGDV (SEQ ID NO: 11) 

25 contain the residues glutamine (Q424) and lysine (K432) 
which participate in the isopeptide bond formed by the 
transglutaminase activity of factor Xllla. Contained also 
within this sequence is a platelet binding actilvity. This 
peptide (Peptide 93.3) was synthesized and similarly shown 

30 to serve as a substrate for factor Xllla. 

Similar results were obtained with Peptide 93.2 
(GEGQQHHLGGARQAGDV) (SEQ ID NO: 12). This sequence 

corresponds to amino acids 421-437 (SEQ ID NO: 11) of human 
fibrin gamma-A protein except that K432 of the natural 

35 sequence has been substituted with arginine (R) . This 
substitution conserves the overall charge of i:he peptide 
block while eliminating the primary amino group of lysine 
which may participate in transglutaminase activity. It 
retains the reactive glutamine Q424 and the flanking 
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recognition sequences for cross-linking. The K to R 
substitution prevents factor Xllia cross-linking the peptide 
with itself. Using reverse phase high performance liquid 
chromatography. Peptide 93.2 (SEQ ID NO: 12) eluted with a 
5 retention time of 15 minutes. A new peak with a retention 
time of 19 minutes appeared when Peptide 93.2 (SEQ ID NO: 12} 
was reacted with thrombin activated factor XI]:i and MDC. 
The amount of the new species increased with increased 
reaction time. Factor Xllla caused the conversion of 
10 Peptide 93.2 to this product with a Km of approximately 5.8 
X 10-4 M. 

An additional amino acid sequence (Peptides 93.4} was 
designed that lacks the reactive glutamine Q424. Since this 
sequence block only includes fibrin gamma chain residues 

15 429-437 (GGAKQAGDV) (SEQ ID N0:13), it can only serve as a 
lysine donor to factor Xllla mediated cross-linking. 

Polymeric substrates comprising either Peptide 93.2 (SEQ 
ID NO: 12) or 93.4 (SEQ ID NO: 13) alone cannot undergo 
extensive cross-linking by transamination. Mixtures of such 

20 polymers, where each contains only one-half of tine substrate 
required for cross-linking, can be used to promote 
interstrand cross-linking, thereby improving cohesive bond 
strength and mechanical properties. By mixing them in 
disproportionate ratios, they may also be used to produce 

25 adhesive formulations with excess Q424 activity, for 
instance, to promote the probability of adhesive/tissue 
bonding. Although, factor Xllla is fairly specific, using 
glutamine residues which have conserved f lankincr amino acid 
sec[uences, it is fairly promiscuous in its use of lysine 

30 residues. Lysines in tissue proteins such as collagen and 
f ibronectin may participate in adhesive cross-linking adding 
to the strength of the adhesive bond. 

EXAMPLE 3 

35 Construction of Plasmids Used to Create Protein Polymer 
Adhesive Substrates 

Construction of plasmid pPT0285 
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Plasmid pACYC184 (Chang, A. Y. C. and Cohen, S. N., Jj. 
Bactieriol. 134:1141-1156 (1978) ) was digested with Banl REN, 
purified by agarose gel electrophoresis, and the DNA 
fragment corresponding to approximately 2,000 bp vbls fiirther 
5 purified using a NACS column. This DNA fragment was filled 
in using DNA polymerase and then self-ligated. Ttie products 
of the ligation mixture were transformed into aoli strain 
HBlOl and selected on bacterial plates containing 
chloramphenicol at 30 /xg/ml. Plasmid DNA from individual 
10 colonies was linearized by digestion with Eco47111. One 
clone, pPT0235, was used as the acceptor vector for 
subsequent DNA manipulations. 

Two oligonucleotide strands (SEQ ID NOS: 14-15) were 
synthesized and purified: 

15 

(Eco47III)PmeI Pmll Nrul Banl StuI EcoRV SnaBI(Snal) 

1 . 5'-GCTATGTTTAAACCACGTGTTCGCGATCCGGGTGCCGATCCAGGCCTGCGArATCAGTACGTA 

2 . 3'-CGATACAAATTTGGTGCACAAGCGCTAGGCCCACGGCTAGGTCCGGACGCTATAGTCATGCAT 

AMPKPRVRDPGADPGLRlfQYV 
20 (SEQ ID NO: 43) 

The two oligonucleotide strands were annealed and 
ligated with the DNA of plasmid pPT0235 which had been 
digested with Eco47III and Snal RENs. The products of this 

25 ligation reaction were transformed into E. coli strain 
HBIOI. Plasmid DNA from transf ormants was purified and 
digested with EcoRI in combination with Eco47III or Snal or 
Nrul RENs. Plasmid DNA from two clones that gave the correct 
digestion pattern was sequenced. One plasmid, designated pPT 

30 0285, was found to be correct and chosen for further 
constructions . 

One oligonucleotide strand coding for the CLP 3.7 gene 
monomer (see Table 1) was synthesized using an Applied 
Biosystems DNA synthesizer model 381A and a 200oA synthesis 

35 column supplied by Glen Research. After the synthesis, the 
226 base DNA segment was deprotected and cleaved from the 
column support by treatment in NH40H at 55 ^C for 6 hrs. 
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5 *- ATGGCAGOSAAAGGGGACCGGTGCCCCGGGTACrrCXrrGGTCCACAAGGTCro 
GCTCCGG6TACTCCAGGTCCGCAAGGCCrrGCa»K;TTCACCGGGTGCTCCG^ 
AGGGCTTGCCGGGATCXXiaiGGTGCACCAGGAACGCa^GGACCTCAG^ 
5 TGCCTTTCCGCTAAAGTCCT6CCGT -3*(5EQ ID N0:16) 

Two additional DNA strands were synthesized to be used 
as priBiers for PGR amplification. The two strands were: 

10 5*- AAG AAG GAG ATA TCA TAT GGC AGC GAA AGG GGA CC -3'(SEQ ID N0:17) 
5'- CGC AGA TCT TTA AAT TAG GGC AGG ACT TTA GCG GAA A-3' (SEQ ID NO: 18) 

The PCR reaction was then performed as previously 
described. The amplified DNA was resuspended and digested 

15 with BanI REN. The digested DNA was purified using a Probind 
filter followed by a Bio-Spin column and then ligated with 
pPT0285 previously digested with BanI REN and treated with 
SAP. The products of the ligation reaction were transformed 
into coll strain HBIOI. Plasmid DNA from transformants 

20 was purified and analyzed as follows. Colonies were picked 
and transferred onto a plate and into a 0.5 ml microfuge 
tube containing 50 /il of lysis buffer (1% Twenn 20, 10/iM 
Tris-HCl pH 8.0, 1 mM EDTA) . The tube was closed, incubated 
at SS'^C for 10 min and then cooled to room temperature. 5m1 

25 of lysate was added to 45^1 MasterMix (Ix VCll buffer as 
described previously, 5U Amplitaq, 200 dNTPs) in a 0.5 ml 
Perkin Elmer thin-walled Gene Amp react:ion tube. 
Amplification was performed in a Perkin Elmer DNA Thermal 
cycler model 480 for 30 cycles with the following step cycle 

30 of 1 min each: 95**C, 52**C, and 72**C. Aliquots from the 
different reactions were analyzed by acrarose gel 
electrophoresis using 1.5% low melting point agarose in 0.5x 
TAE buffer. Plasmid DNA from the clones showing the correct 
size insert was purified and analyzed by DNA sequencing. 

35 Plasmid pPT0310 contained the desired CLP 3.7 monomer 
sequence (see Table 2) . 
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TABLE 2 (SEP ID NO: 19) 

BanI Aval/Smal 
5*- GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCOGGGAAGCCCA 
5 3 *- CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGCT 

GAPGTPGPQCLPGSP 



Banll Gsul StuI Oralll 

GGGGCTCCCGGTACTCCAGGTCCCCAAGGCCTGCCCCGTTCACCC 
1 0 CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACCGCCCAAGTGGC 
6APGTPGPQGLPGSP 



Bgll BamHI 
GGTGCTCCGGGAACTCCTCGCCCGCAGGGCTTGCCGGGATCCCCA 
1 5 CCACGAGGCCCTTGAGGACC6GGCGTCCCGAACGGCCCTAGGGGT 
GAPGTPGPQGLPGSP 



EcoOlOSI Ban! 

GGTGCACCAGGAACGCCGGGACCTCAGGGTCTTCCGGGTAGCCCTGGirGCC -3 ' 

2 0 CCACGTGGTCCTTGCGGCCCTGG AGTCCCAGAAGGCCCATCGGGACCnCXJG - 5 ' 
GAPGTPGPQGLPGSP (G A) 
(SEQ ID NO: 44) 



CliP 3.7 Polymer constiruction 

25 Plasmid DNA from pPTO0310 was digested with IJanlREN and 

the digestion fragments were separated by aijarose gel 
electrophoresis. The CLP 3.7 gene fragment, 130 bp, was 
excised and purified by NACS column (see Methods) . The 
purified fragment was ligated with plasmid pSY1262 which had 

30 been prepared as follows: psyi262 plasmid DNA was digested 
with BanI REN and subsequently treated with Shrimp Alkaline 
Phosphatase (SAP) as described in Example 1. 

The product of this ligation reaction was transfored 
into E.coli strain HBIOI. Transf ormants were selected for 

35 resistance to kanamycin. Plasmid DNA from individual 
transformants was purified and analyzed for increased size 
due to CLP 3.7 multiple DNA insertion. Several clones were 
obtained and two of them containing inserts of apjproximately 
1.25 kbp and 2.6 kpb (pPT0314 and pPT0312 respectively) were 

40 chosen to be used for expression of CLP 3.7. 
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rr.P 3.7 Analysis 

E.coli strain HBlOl containing plasmid pPT0312 or 
pPT0314 were grown as described in Example 1. T^ie proteins 
produced by these cells were analyzed by SDS-PAGE for 
5 detection of reactivity to CLP antibodies. In eve:ry analysis 
a strong reactive band was observed with an apparent 
molecular weight of 130 kD and 50 kD respectively. 

pPT0312 CLP 3.7 837 AA MW 72,637 

10 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 
( (GAPGTPGPQGLPCSP)4],3 
GAMDPGRYQLSAGRYHYQLVWCQK 

(SEQ ID NO: 51) 

15 

EXAMPLE 4 

Protein Polymer Adhesive Substrates (PPAS) 

PPAS polymers were designed to include oligopeptide 

20 blocks of human fibrin gamma chain which contain either all 
or part of the site of factor Xllla cross-linking . The amino 
acid sequences of Peptides 93.3 (SEQ ID N0:11), 93.2(SEQ ID 
NO: 12), and 93.4 (SEQ ID NO: 13) were incorporated within a 
structural backbone consisting of 3 complete repeats of a 15 

25 amino acid peptide block of human collag<2n type I 
( GAPGTPGPQGLPGSP ( SEQ ID NO : 2 0 ) , the CLP3 . 7 monomer 
repeating amino acid sequence) and designateid PPASl-A, 
PPASl-B, and PPASl-C, respectively. 

A variety of structural backbones can be nsed in the 

30 design of adhesive polymers with the option of changing the 
physical properties of the polymer chain. The composition 
of the backbone will effect the solubility of the polymer as 
well as its rheological properties. CLP (collagen-like 
protein) polymers are useful in this respect in that they 

35 are extremely soluble in water, allowing protein solutions 
of greater than 10 weight percent to be formed while still 
maintaining good flow properties. CLP polymers have good 
adhesion to hydrophilic surfaces such as glass and therefore 
may adhere well to tissue. However, other backbones with 
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different properties such as SLP (silk-like protein) , ELP 
(elastin-like protein) , KLP (keratin-like protein) , or 
copolymers of these could also supply useful properties. 
The 17 amino acid fibrin block of Peptide 93.3 (SEQ ID 

5 NO: 13) was integrated with the CLP3-7 monomer sequence (SEQ 
ID NO: 20) so as to recreate its hydrophobicity and. secondary 
structure environment matching, as closely as possiible, that 
of hximan fibrin. Although, this 17 amino acid block is the 
C-terminal sequence of fibrin gamma chain (chain-A) , a 

10 variant gamma chain (chain-B) exists in the blood which 
contains an additional 16 amino acids beyond the cross- 
linking site. Because the gamma-B chain also participates 
in factor Xllla cross- linking, it follows that the 17 amino 
acid sequence block does not have to be C terminal for 

15 activity. Thus, it can be expected that protein polymers 
consisting of tandem repeats of monomer blocks containing 
the fibrin gamma factor Xllla cross-linking seq\3ience, will 
contain multiple, active sites of potential cross-linking. 
Cross-linking may occur between polymer chains; and with 

20 neighboring tissue. During normal healing of the wound, the 
protein based adhesive will be degraded and resorbed using 
essentially the same mechanisms of proteolysis by which 
normal blood clots are dissolved. The piroducts of 
degradation are peptides of essentially human £;equence or 

25 amino acids which may be reutilized by the body. 



PPft^Sl-A gene monomer synthesis and constr uction 

The PPASl-A amino acid monomer sequence with the fibrin 
gamma sequence shown in bold is as follows: 

30 

(CAPGTPGPQGLPGSP)3GAPGTPCEOQQHHLCCAKQAGDVGSP (SEQ ID NO: 21) 

One oligonucleotide strand (see Table 3) was jsynthesized 
using an Applied Biosystems DNA synthesizer model 381A and 
35 a 2000A pore resin synthesis column supplied by Glen 
Research. During the synthesis, the required interrupt-pause 
steps for reagent bottle changes were minimized. After the 
synthesis, the 123 base DNA fragment was deprotected and 
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cleaved from the column support by treatment in ammonivua 
hydroxide at 55 ®C for 6 hrs, 

TABLE 3 (SEQ ID NO:22) 
5 5*- ATGGCAGCGAAAGGGGACCGTGCACCAGGAACGCCGGGAGAAG6TCAACAGCACCATCTT6GTGG 
AGCGAAACAGGCAGCCGACGTCGGTAGCCCTGGTGCCTTTCCGCTAAAGTCCTGCCGT -3* 

The PGR reaction was then performed as previously 
described using the same primers as were uised in the 

10 construction of the CLP3.7 monomer. The amplified DNA was 
then resuspended and digested with ApaLI and Dral RENs. The 
digested DNA was then purified using a Prokdnd filter 
followed by a Bio-Spin column and then ligated with pPT0310 
previously digested with ApaLI and EcoRV RENs and purified 

15 by NAGS column. The products of the ligation reaction were 
transformed into E. coli strain HBIOI. Plasmid DNA from 
transformants was purified and analyzed by digestion using 
ECOO109, Hindi and Hindlll RENs. Plasmid DMA from the 
clones showing the correct size insert was purified and 

20 analyzed by DNA sequencing. Plasmid pPT0318 contained the 
desired PPASl-A gene monomer sequence (see Table 4) . 

TABLE 4 fSEO ID NO:23) 
BanI Aval/Smal 

GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCA 
CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 
GAPGTPGPQGLPGSP 

Banll Gsul stui Dralll 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
GAP GTPGPQGLPGSP 

Bgll BaxnHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
GAPGTPGPQGLPGSP 

GGTGCACCAGGAACGCCGGGAGAAGGTCAACAGCACCATCTTGGT 
CCACGTGGTCCTTGCGGCCCTCTTCCACTTCTCGTGGTAGAACCA 
GAPGTPGEGQQHHLG 



25 5'- 
3'- 



30 
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Aatll BanI 
GCAGCG AAACAGGCAGGCGACGTCGGTAGCCCTGGTGCC -3 * 
CCTCGCTTTGTCCGTCCGCTGCAGCCATCGGGACCACGG -5' 
GAKQAGDVGSP{G A) (SEQ ID NO:45) 

5 

Construction of expression nlasinid PPT0317 

Plasmid DNA pSV1262 was linearized with PvuII REN, then 
passed through a Probind filter followed by a Bio-Spin 
column. The DNA was then treated with SAP and ligated with 

10 a DNA fragment from pQE-17 (QIAGEN Catalog #33173 ) prepared 
as follows. Plasmid DNA pQE-17 was digested witti Bglll and 
Hindlll RENs and the 36 bp fragment (see Table 5) was 
purified using a Probind filter and then a Bio-S]pin column. 
The DNA was purified further using a Microcon-30 filter and 

15 the filtrate containing the 36 bp was kept. The DNA was then 
treated with DNA Polymerase I and purified through a Probind 
filter and then a Bio-Spin column. 



TABLE 5 

20 5*- GATCTTCGATCTCATCACCATCACCATCACTA (SEQ ID NO: 24) 

3'- AAGCTAGACTAGTGCTAGTCGTAGTGATTCGA (SEQ ID NO: 25) 



The products of the ligation reaction were transformed 
into E. coli strain HBIOI. Plasmid DNA from transf ormants 
25 was purified and analyzed by digestion using BstYI and 
Bstll07I RENs. Plasmid DNA from the clones showing the 
correct restriction pattern was purified and anal.yzed by DNA 
sequencing. Plasmid pPT0317 contained the desired DNA insert 
and was used for further DNA manipulations. 

30 

PPASl-A polvmer construction 

Plasmid DNA from pPT0318 was digested with EtanI REN and 
the digestion fragments were separated by agarose gel 
electrophoresis. The PPASl-A gene fragment, :>16 bp, was 
35 excised and purified using the Ultrafree-MC filter. The 
purified fragment was ligated with plasmid pPT0317 which had 
been prepared as follows. Plasmid DNA pPT0317 was digested 
with BanI REN, then passed through a Probind fil1:er and then 
a Bio-Spin column. The DNA was then treated witli SAP. 



42 



wo 95/23611 



PCrrAJS95/02728 



The products of the ligation reaction were transformed 
into E, coli strain HBlOl, Transf onaants were selected for 
resistance to kanamycin. Plasmid DNA from individual 
transf ormants was purified and analyzed using EcoRI and 
5 EcoRV RENs for the presence of PPASl-A multimer gene 
inserts. Several clones were obtained with insert sizes 
ranging from 200 bp to approximately 4 kb. Several clones 
containining from 10 to 20 repeats were chosen for use in 
expression of the PPASl-A polymer. 

10 

PPASl-A expression analysis 

E. coli strain HBlOl containing plasmid pPT0321, 
pPT0325, pPT0326, or pPT0327 was cultured as previously 
described. The proteins produced by these cells showed 
15 strong reactive bands of apparent molecular weights ranging 
from 80 kD to 180 kD when analyzed by western blot for 
reactivity to CLP antibody. One clone, pPT0321, containing 
10 repeats of the PPASl-A monomer was selected for further 
study . 

20 

PPT0321 (SEQ ID NO: 26) PPASl-A 762 AA MW 68,056 

KDPWLQRROWENPGVTQLKRLAAHPPFASDPM 

[ ( GAPGTPGPQGLPGSP ) , GAPGTPGEGQQHHLGGAKQAGDVGSP ] ,o 
2 5 GAKDPGRYQDLRSHHHHHH 

PPASl-B gene monomer synthesis and construction 

The PPASl-B amino acid monomer sequence with the fibrin 
gamma sequence shown in bold is as follows: 

30 

(GAPGTPGPQGLPGSP) 3 GAPGTPGE6QQHHLGGARQAGDVGSP (SEQ 113 NO:27) 

Two oligonucleotide strands (see Table 6) were 
synthesized and purified as previously described. 

35 

TABLE 6 

1. 5* GTGGAGCTCGCCAGGCAGGCGACGT(SEQ ID NO;2B) 

2. 3* GAAGCACCTCGAGCGGTCCGTCCGC (SEQ ID NO: 29) 
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10 



These oligonucleotide strands were annealed and ligated 
with plasmid pPT0318 which had been digested with BstXI and 
Aatll RENs. The products of this ligation reaction were 
transformed into E. coll strain HBIOI. Plasmid DNA from 
transformants was purified and digested with Ncol and Sad 
RENs to determine whether they had the correct restriction 
pattern. Plasmid DNA from correct clones was sequenced. 
Plasmid pPT0320 (shown in Table 7) contained the desired 
PPASl-B monomer sequence. 



TABLE 7 (SEQ ID NO: 30) 
Banl Aval/Smal 
5'- GGTCCCCCCGGTACTCCTCGTCCACAAGGTCTGCCGGGAAGCCCA 
3 * - CCACGGGG CCCATG AGGACCAGGTGTTCCAGACGG CCCTTCGG GT 
15 GAPGTPGPQGLPGSP 

Banll Gsul StuI Drain 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACCGCCCAAGTGGC 
20 GAPGTPGPQGLPGSP 

Bgll BamHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGGT 
25 GAPGTPGPQGLPGSP 

GGTGCACCAGGAACGCCGGGAGAAGGTCAACAGCACCATCTTGGT 
CCACGTGGTCCTTGCGGCCCTCTTCCAGTTGTCGTGGTAGAAC:CA 
GAPGTPGEGQQHHLG 

30 

Aatll Banl 
GGAGCTCGCCAGGCAGGCGACGTCGCTAGCCCTGCTGCC -3' 
CCTCGAGCGGTCCGTCCGCTGCAGCCATCGGGACCACGG -5' 
GARQAGDVGSP(G A) (SEQ ID NO:46) 

35 

PPASl-B polvmer construction 

Plasmid DNA from pPT0320 was digested with Banl REN and 
the digestion fragments were separated by agarose gel 
electrophoresis. The PPASl-B gene fragment, 216 bp, was 
40 excised and purified using an Ultrafree-MC filter. The 
purified fragment was ligated with plasmid pPT0317 prepared 
as described above. 
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The products of this ligation reaction were transformed 
into E, coll strain HBIOI. Transf oraants were selected for 
resistance to kanamycin. Plasmid DNA from individual 
transfoinnants was purified and analyzed using EcoRI and 
5 EcoRV RENs for DNA inserts containing multimers of the 
PPASl-B gene monomer. Several clones were obtained 
containing inserts up to 5 kb in size. 

PPAS1*B expression analysis 

10 E. coli strain HBlOl containing plasmid pPT0324, 

containing 10 repeats of the PPASl-B monomer seijuence, was 
cultured as previously described. The proteins produced by 
these cells were analysed by western blot for reactivity to 
CLP antibody. A strong reactive band was observed with an 

15 apparent molecular weight of approximately 90 kD. 

PPT0324 (SEQ ID NO: 31) PPASl-B 762 AA KiW 68,336 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPM 

( ( GAPGTPGPQGLPGSP ) 3 GAPGTPGEGQQHHLGGARQAGDVGSP ) 10 
20 GAMOPGRYQDLRSHHHHHH 

PPASl-C gene monomer synthesis and construction 

The PPASl-C amino acid monomer sequence with, the fibrin 
gamma sequence shown in bold is as follows: 



25 



(GAPGTPGPQGLPGSP)} GAPGTPGGAKQAGDVGSP (SCQ ID NO: 32) 



Two Oligonucleotide strands (SEQ ID NOS: 33-34) (see Table 
8) were synthesized and purified as previously described. 

30 

TABLE 8 

5' TGCACCAGGAACGCCGGGAGGTGCTAAACAAGCAGGAGACGTCGGTAGCCC:rCGTGCCTTT 
3 • GGTCCTTGCGGCCCTCCACG ATTTGTTCGTCCTCTGCAGCCATCGGGACCACGGAAA 

These oligonucleotide strands were annealed and ligated 
with plasmid pPT0310 which had been digested with ApaLI and 
EcoRV RENs. The products of this ligation reaction were 
transformed into E. coli strain HBIOI. Plasmid DNA from 
transf ormants was purified and digested witli. BsaHI and 
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Hindlll RENs to determine their restriction pattern. Plasmid 
DNA from correct clones was sequenced. Plasmid pPT0319 
(shown in Table 9) contained the desired PPASl-C gene 
monomer sec[uence. 

5 

TABLE 9 fSE0 ID NO: 35) 
Banl Aval/Smal 
5'- GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGCAAGCCiaV 
3 CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGIST 
10 GAPGTPGPQGLPGSP 

Banll GsuZ StuI Dralll 

GGGGCTCCGGGTACTCCA6GTCCGCAAGGCCTGCCGGGTTCAD36 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGi:;C 
15 GAPGTPGPQGLPGSP 



Bgll BamHI 
GGTGCTCCGGGAACTCCTGGCCCGCAGGGCTTGCCGGGATCCCCA 
CCACGAGGCCCTTGAGGACCGGGCGTCCCGAACGGCCCTAGGGiST 
20 GAPGTPGPQGLPGSP 



ApaLX 

GGTGCACCAGGAACGCCGGGAGGTGCTAAACAAGCAGGAGACGTC 

CCACGTGGTCCTTGCGGCCCTCCACGATTTGTTCGTCCTCTGCAG -^"^^ 
25 GAPGTPG6AKQAGDV 

Banl 

GGTAGCCCTGGTGCC -3' 
CCATCGGGACCACGG -5* 
30 G S P (G A)(SEQ ID NO:47) 



PPASl-C polymer construction 

Plasmid DNA from pPT0319 was digested with Banl REN and 
the digestion fragments were separated by agarose gel 

35 electrophoresis. The PPASl-C gene fragment, 1S»2 bp, was 
excised and purified using an Ultrafree-MC filter. The 
purified fragment was ligated with plasmid pPT0317 which had 
been prepared as described above. The produdis of this 
ligation reaction were transformed into E. coli strain 

40 HBlOl. Transformants were selected for resistance to 
kanamycin. Plasmid DNA from individual transformants was 
purified and analyzed using EcoRI and EcoRV RENs for DNA 
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inserts containing multimers of PPASl-C gene monomer. 
Several clones were obtained and one of thewi, pPT0322, 
containining an insert of approximately 2 kb, containing 10 
repeats of the PPASl-C gene monomer, was chosen for 
5 expression analysis. 

PPASl-C expre ssion analysis 

E. coli strain HBlOl containing plasmid pPT0322 was 
cultured as previously described. The proteins produced by 
10 these cells were analysed by western blot reactivity with 
CLP antibody. A strong reactive band, was obser/ed with an 
apparent molecular weight of approximately 80 kD. 

PPT0322 (SEQ ID NO: 36) PPASl-C 682 AA MW 59,192 

15 MDPWLQRRDWENPGVTQLNRIAAHPPPASDPM 

( < GAPGTPGPQGLPGSP ) , GAPGTPGGAKQAGDV6SP ] ,o 
GAMDPGRYQDLRSHHHKHH 

PPASl-D gene monomer synthesis and construction 
20 The PPASl-D amino acid monomer sequence witti the fibrin 

gamma POLSITE sequence shown in bold is as follows: 

(GAPGTPGPQGLPGSP)2GA TRWYSMKKTTMKI IPFNRLTI GEGQQHHLGGARQAGDV GSP 

(SEQ ID NO: 38) 

25 One oligonucleotide strand coding for the POLSITE 

portion of the gene monomer (see Table 10) was synthesized 
using an Applied Biosystems DNA synthesizer model 381A and 
a 200oA synthesis column supplied by Glen Research. During 
the synthesis, the required interrupt-pause steps for 

30 reagent bottle changes were minimized. After th« synthesis, 
the 126 base DNA strand was deprotected and cleaved from the 
column support by treatment in ammonium hydroxide at 55*0 
for 6 hrs. 

35 TABLE 10 (SEQ ID NO:39) 

5*- ATGGCAGCGAAAGGGGACCACCGGGTGCTACCCGTTCGTATTCTATGAAAAAGACTACCATCAA 
AATCATTCCGTTTAACCGCCTGACCATTGGCGAAGGTCAACTTTCC5GCTAARGTCCTGCCGT-3' 
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The PGR reaction was then performed as previously 
described using the same primers as were used in the 
construction of the CIiP3.7 monomer. The DNA was resuspended 
and digested with Dralll and Hindi RENs and the digested 
5 DNA was purified using a Probind filter followed by a Bio- 
Spin column and then ligated with pPT0320 previously 
digested with Dralll and Hindi RENs and purified with a 
Probind filter followed by a Bio-Spin column. The products 
of the ligation reaction were transfoirmed into E. coli 

10 strain HBIOI. Plasmid DNA from transf ormants was purified 
and digested with Nlalll and plasmids giving the correct 
restriction pattern were sequenced. A plasmid containing 
the desired PPASl-D monomer sequence, pPT0328, (see Table 
11) was used for further DNA constructions. 

15 TABLE 11 f SEP ID N0:52) 



BanI Aval/Smal 
5*. GGTGCCCCGGGTACTCCTGGTCCACAAGGTCTGCCGGGAAGCCCA 
3*- CCACGGGGCCCATGAGGACCAGGTGTTCCAGACGGCCCTTCGGGT 
20 GAPGTPGPQ6LPGSP 



Banll Gsul StuI Drain 

GGGGCTCCGGGTACTCCAGGTCCGCAAGGCCTGCCGGGTTCACCG 
CCCCGAGGCCCATGAGGTCCAGGCGTTCCGGACGGCCCAAGTGGC 
25 GAPGTPGPQGLPGSP 



30 



35 



40 



GGTGCTACCCGTTGGTATTCTATGAAAAAGACTACCATGAAAATC 
CCACCATGGGCAACCATAAGATACTTTTTCTGATGGTACTTTTAG 
GATRWYSMKKTTMKI 

Hindi 

ATTCCGTTTAACCGCCTGACCATTGGCGAAGGTCAACAGCACCAT 
TAAGGCAAATTGGCGGACTGGTAACCGCTTCCAGTTGTCGTGGTA 
IPFNRLTIGE6QQHH 

Aatll BanI 
CTTGGTGGAGCTCGCCAGGCAGGCGACGTCGGTAGCCCTGGTGCC -3 ' 
GAACCACCTCGAGCGGTCCGTCCGCTGCAGCCATCGGGACCACGG -5* 
LGGARQAGDVGSP(G A)(SEQ N0$48) 



PPASl-D Dolvmer construction 
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pPT0328 plasmid DNA containing the gene laoncuaer coding 
for PPASl-D is digested with BanI REN and the digestion 
fragments are separated by agarose gel electrophoresis. The 
PPASl-D gene fragment, 219 bp, is excised and purified using 
5 an Ultrafree-MC filter. The purified fragment is ligated 
with plasmid pPT0317 prepared as described above. . The 
products of this ligation reaction are transfonxed into £. 
coli strain HBIOI. Transf ormants are selected for 
resistance to kanamycin. Plasmid DNA from individual 
10 transformants is purified and analyzed using EcoRI and EcoRV 
RENs for DNA inserts containing multimers of the PPASl-D 
gene monomer. An approriate clone containing a DNA insert 
from 1 to 4 kb is analyzed for PPASl-D polymer e^cpression. 

15 PPASl-D protein polymer sequence: 

KOPWLQRRDWENPGVTQLNRLAAHPPFASDPM 

[ < GAPGTPGPQGLPGSP ) ; GATRWYSMKKTTMKI IPFNRLTIGEGQQHHLGGARQA(;DVGSP ] , 
GAMDPGHYQDLRSHHHHHH (SEQ ID NO: 53) 

20 

Where n = 2-20 
EXAMPLE 5 

PPASl-A Activity Assays 

coli strain PPT0321 containing the PPASjL-A polymer 

25 gene was produced by fermentation. The product was purified 
from the cellular biomass by means of cellular lysis, 
clearance of insoluble debris by centrifugation, and 
affinity chromatography. The purified product was analyzed 
by sodium dodecyl sulfate polyacrylamide gel 

30 electrophoresis, immunoreactivity with CLP antibody, and 
amino acid analysis. A protein band of apparent molecular 
weight 85,000 was observed by amido black staining of SDS- 
PAGE separated and transferred samples and the: same band 
reacted with the CLP antibody on western tdots. As 

35 expected, amino acid analysis indicated that the product was 
enriched for the amino acids glycine (34.3*), alanine 
(7.3%), proline (28.5%), and glutamine (7.0%). The amino 
acid composition (see Table 12) shows the correlation 
between the composition of the purified product and the 
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7.3 


8.4 


2.5 


2.3 


7.0 


9.7 


0.15 


0.1 


34.3 


30.5 


3.14 


3.5 


0 


0 


0.9 


1.3 


4.6 


5.8 


0.38 


0.4 


28.5 


24.4 


0.42 


0.7 


4.8 


5.5 


4.7 


5.4 


1.3 


1.7 


0 


0.1 



expected theoretical composition as deduced from the 
synthetic gene sequence. 

TABLE 12 

5 Amino Acid Analysis of Purified PPASl-A 

ACTUAL THEORETICAL 
Amino Acid pmoles % composition ^ composition 

10 Ala 73.74 

Asx 25.57 

GlX 71.11 

Phe 1.48 

Gly 346.27 

15 His 31.69 

He 0 

Lys 9.10 

Leu 46.20 

Met 3.83 

20 Pro 288.16 

Arg 4.27 

Ser 48.07 

Thr 47.84 

Val 12.61 

25 Tyr 0 

Purified PPASl-A was analyzed for its ability to serve 
as a substrate for the blood clotting enzyme, factor Xllla. 
The tests were run in two ways. A plate assay was conducted 

30 in which PPASl-A protein was coated onto the wells of a 
standard 96-well microtiter plate. Dilutions of PPASl-A 
solution were applied into individual wells and allowed to 
stand overnight at 4 degrees centigrade. Adjsicent wells 
were similarly coated with diluted solutions of Zhe protein 

35 B-casein (bovine milk protein, a known substrate for factor 
Xllla) or left uncoated as negative controls. After excess 
coating solution was washed from the wells, a solution 
containing both factor XIII and thrombin which had been 
preincubated in order to achieve activation of the factor 

40 XIII was applied to each well. Some wells remained free of 
enzyme solution and served as background controls. 

A buffer containing the compound 5- 
biotinamidopentylamine (BAPA) was also added to each well. 
BAPA is a substrate analog for factor Xllla which becomes 

45 bonded to a glutamine containing substrate protein via 
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tranglutaminase activity. The reaction of BAPA with B- 
casein is known to be factor Xllla dependent. All wells 
were incubated at 37 degrees centigrade. The wells were 
washed several times to remove unreacted sub£;trates and 
5 enzymes, and filled with a solution containing sbreptavidin 
conjugated horse radish peroxidase (streptavidin binds with 
high affinity to BAPA) . The wells were again washed and a 
solution containing the chromogenic substrate for horse 
radish peroxidase (HRP) was added. Upon incubation at room 
10 temperature, wells began to turn blue. The reaction was 
stopped by adding 0.1 N oxalic acid and the degree of color 
was quantified by absorbance of light at a wavelesngth of 410 
nm. A color reaction was seen in wells coated with B-casein 
with the greater color corresponding to the greater 
15 concentration of B-casein in the coating solution. Wells 
containing PPASl-A also produced a color reaction which 
increased in intensity with greater PPASl-A coating 
concentrations. The PPASl-A color reaction was clependent on 
the presence of factor Xllla, as evidenced from the absence 
20 of color in wells lacking factor Xllla. 

A similar assay was run to confirm that the product 
responsible for the color reaction in the plate assay was 
indeed PPASl-A protein. Reactions were conducted in test 
tubes containing PPASl-A protein, activated factor XIII, 
25 BAPA, and buffer solution. Similar reactions were conducted 
also with B-casein protein as a control. After incubation 
at 37 degrees centigrade, samples of the reactions were 
treated with detergent solution and heated to 100 degrees 
centigrade for 5 minutes, loaded and electrophoresed on SDS- 
30 PAGE gels and transferred to filters. Identical filters 
were either reacted with streptavidin conjugated horse 
radish peroxidase or with CLP antibody. Tlie antibody 
reacted filters were subsequently reacted with horse radish 
peroxidase conjugated goat anti-immunoglobulin antibody. 
35 Both filters were exposed to chemiluminescent reagent 
substrate for HRP and exposed to X-ray film. Luminescent 
bands were observed on one panel where BAPA conjugated 
proteins resided. The B-casein lane contained a band of 
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approximately 24,000 daltons (the expected molecular weight 
for B-casein) . The PPASl-A lanes contained the polymer 
bands which correlated with the molecular weight of bands 
observed on the filter reacted with anti-CLP antibody. The 
5 reactivity of these bands was not observed in Isines loaded 
with reactions in which factor Xllla had been omitted. 

These data indicate that PPASl-A serves as a substrate 
for the blood clotting factor Xllla. The activity observed 
is consistent with the creation of a covalent bond between 

10 PPASl-A and the substrate analog BAPA. The nat\iral activity 
of factor Xllla cross-links two fibrin protein chains by 
creating an isopeptide bond between a glutamine residue on 
one chain and a lysine residue on the oliher. By 
incorporating the fibrin oligopeptide block containing the 

15 active glutamine residue within the PPASl-A chain, synthetic 
protein substrate for factor Xllla was created. In the 
presence of BAPA or other compounds such as prolieins which 
contain a reactive primary amino group equivalent, to lysine, 
factor Xllla will cause the linkage of such comjpounds with 

20 PPASl-A. The activity of such polymers, whether produced by 
traditional chemical synthesis or recombinant means, and 
factor Xllla has utility as an adhesive, sealant, or bonding 
agent. They may be used in the creation of cross-linked 
hydrogel materials which can encapsulate live cells, tissues 

25 or organs. They may be used to incorporate small molecules 
or active agents to proteins through nonhydrolyzing but 
proteolytically susceptible linkages. This chemistry can be 
used to attach pharmaceuticals to resorbable protein 
matrices for use in drug delivery. 

30 

EXAMPLE 6 

Construction of PPASl-F and PPASl-G 

Plasmid DNA pPT312 was linearized with PvuII REN, then 
35 passed through a Millipore Probind filter. The E^NA was then 
treated with SAP. The linearized pPT312 DNA was then 
ligated with a DNA fragment from pQE-17 ( QIAGEN Catalog # 
33173 ) prepared as follows. Plasmid DNA pQE-17 was 
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digested with Bglll and Hindlll RENs and the 36 bp fragment 
(depicted in Table 5, above) was purified using l>robind and 
Biospin columns as described above* The DNA was purified 
further using a Microcon 30 column by centrifuging as 
5 described above and the filtrate, containing the 3 6 bp, was 
kept. The DNA was then treated with DNA Polymerase .1 and 
purified through a Probind and then a Biospin column. 

The products of the ligation reaction were transformed 
into E.coli strain HBIOI. Plasmid DNA from transformants 

10 was purified and analyzed by digestion using Bstll07I and 
EcoRV RENs. Clones containing the desired DNA fragment were 
further digested with Bstll07I and BstYI RENs to determine 
the orientation of the insert. Plasmid DNA from the clones 
showing the correct restriction pattern was puirified and 

15 analyzed by DNA sequencing. Plasmid pPT0337 contained the 
desired DNA insert and was used for further DNA 
manipulation. 

Plasmid DNA pPT0337 was digested with Xcml REtJ, followed 
by Mung Bean Nuclease treatment for 30 min. at 37 •»C. The DNA 
20 was then purified using Probind and Biospin and tlien treated 
with SAP followed by Probind and Biospin column 
purification . 

The PGR amplified DNA, coding for the PPASl-A (Table 1) , 
was digested with ApaLI and Bglll RENs, the fratgments were 

25 purified by agarose gel electrophoresis followed by 
Ultrafree MC gel purification. The DNA was then treated with 
DNA Polymerase I Klenow fragment and then purified using a 
Probind column followed by a Biospin column as; described 
previously. The DNA was then ligated with pPT0337. 

30 The products of the ligation reaction were iiransformed 

into E.coli strain HBIOI. Plasmid DNA from transformants 
was purified and analyzed by digestion using EcoRI and Dral 
RENs. Plasmid pPT0334 contained the desired insert and was 
used for subsequent constructions. 

35 The PCR amplified DNA coding for PPASl-A was again 

digested with EcoRV REN, then the enzyme was removed with a 
Probind column followed by treatment with BsaJI REN, then 
purified by Probind and Biospin columns and concentrated in 
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vacuo. The DNA was treated with DNA Polymereise Klenow 
fragment followed by Probind, the DNA fragments were 
purified by agarose gel electrophoresis followed by 
Ultrafree MC gel purification, concentrated in vacuo 
5 followed by Biospin. The DNA was then ligated with plasmid 
DNA pPT334 previously digested with EcoRV REN followed by 
Probind and Biospin and then treated with SAP followed by 
Probind and Biospin columns. 

The products of the ligation reaction were transformed 

10 into E.coli strain HBlOl. Plasmid DNA from transf ormants 
was purified and analyzed by digestion using BstXI REN, The 
clones containing the desired DNA fragment were further 
digested with AccI and EcoRV RENs to detcirmine the 
orientation of the insert. Plasmid pPT0338 contaiined the DNA 

15 fragment in the correct orientation and was used for 
subsequent constructions. 

Plasmid DNA pPT0338 was digested with BanI F:EN and the 
digestion fragments were separated by ag.arose gel 
electrophoresis, the DNA was excised and self -ligated. The 

20 products of the ligation mixture were transformed into 
E.coll strain HBlOl. Plasmid DNA from transf ormants was 
purified and analyzed by digestion using BamHI and Bstll07I 
RENs. Plasmid pPT0339 contained the desired deletion and was 
used for subsequent constructions. 

25 Plasmid DNA pPT0339 was digested with BanI REl^, followed 

by Probind and Biospin and then treated with SAP followed by 
Probind and Biospin columns. The plasmid DNA so treated was 
ligated with the CLP gene fragments from pPT0312- Plasmid 
DNA pPT0312 was digested with BanI REN and the CLP gene 

30 fragments purified by agarose gel electrophoreses followed 
by NACS and Biospin columns. 

The products of this ligation reaction were liransformed 
into E.coli strain HBlOl. Transf ormants were selected for 
resistance to kanamycin. Plasmid DNA from individual 

35 transformants was purified and analyzed for DNA inserts 
containing multimers of PPASl-F gene monomers. Several 
clones were obtained and one of them (pPT0348) containining 
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an insert of approximately 2.2 kb (12 repeats of the CLP 
3.7 gene monomer) was chosen for expression analysis, 

PPASl-F Expression 

5 

An overnight culture of £• coli strain HBlOl containing 
plasmid pPT0348 grown at 30<»C was used to inoculate 50 ml of 
LB media contained in a 250 ml flask. Kanamyciji was added 
at a final concentration of 50 Mg per ml and the culture was 

10 incubated with agitation (200 rpm) at 30»C, When the culture 
reached an ODfioo of 0.8, 40 ml were transferred to a new 
flask prewarmed at 42**C and incubated at the same 
temperature for approximately 2 hours. The cultuies (30* and 
42*>) were chilled on ice and OD500 was taken. Cells were 

15 collected by centrifugation and divided in 1.0 OD600 
aliquots . The proteins produced by these cells were 
analysed by western blot reactivity with anti-CLP antibody. 
A strong reactive band was observed with an apparent 
molecular weight of approximately 94 kD. The expected amino 

20 acid sequence of the PPASl-F polymer encoded by plasmid 
pPT0348 is shown below. 

PPT0348 (SEQ ID NO: 54) PPASl-F 829 AA MW 72,437 

25 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDAPGTPGEGQQHHLGGAKQAODVGSP 
< GAPGTPGPQGLPGSP ) « 
3 0 GAHDPGRYHMAAKGDRAPGTPGEGQQBHLGGAKQAGDVGSPDQDLHSHHHHHH 

The fibrin gamma chain sequence is shown in bold. 

Construction of PPASl-G 

35 Plasmid DNA pPT0339 prepared as described in the PPASl-F 

construction, was ligated with the SELP8 gene fragments from 
pPT0289. Plasmid DNA pPT0289 (described below) was digested 
with BanI REN and the SELP8 gene fragments were purified by 
agarose gel electrophoresis followed by NACS and Biospin 

40 columns. 
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The products of this ligation reaction were transformed 
into E.coli strain HBIOI. Transf ormants were selected for 
resistance to kanaiaycin. Plasmid DNA from individual 
transf ormants was purified and analyzed for DNA inserts 
5 containing multimers of PPASl-G gene monomeirs. Several 
clones were obtained and one of them, pPT0349, containining 
an insert of approximately 2*4 kb (12 repeats of the SELP8 
gene monomer) was chosen for expression analysis. 

10 PPASl-G Expression 

An overnight culture of E. coli strain HBlOl containing 
plasmid pPT0349 grown at 30«C was used to inoculate 50 ml of 
LB media contained in a 250 ml flask* Kanamycin was added 

15 at a final concentration of 50 ;ig per ml and the culture was 
incubated with agitation (200 rpm) at 30**C. When the culture 
reached an OD^oq of 0.8, 4 0 ml were transferred to a new 
f lask prewarmed at 4 2 ^ C and incubated at the same 
temperature for approximately 2 hours. The cultu]res (30« and 

20 42**) were chilled on ice and OD^oo was taken. Cells were 
collected by centrif ugation and divided in 1.0 OD^ 
aliquots. The proteins produced by these cells w€»re analysed 
by western blot reactivity with anti-SLP antibocly. A strong 
reactive band was observed with an apparent molec:ular weight 

25 of approximately 94 kD* The expected amino acid sequence 
of the PPASl-G polymer encoded by plasmid pPT0349 is shown 
below. 



30 pPT0349 (SEQ ID NO: 55) PPASI-G 877 AA MW 69,941 

MDPWLQHROWEKPGVTQLNRLAAHPPFASDAPGTPGEGQQHHLGGAKQAODVGSPGAGSGAGAGS 
((GVGVP) .(GAGAGS) ^ )„ 
(GVGVP) ,(GAGAGS)3 
35 GAGAMDPGRYHMAAKGDRAPGTPGEGQQBHLGGAKQAGOVGSPDQDLRSHHHHHH 

The fibrin gamma chain sequence is shown in bold* 
Protein Polymers as Factor XIII Substrates 
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PPAS 1-F and PPAS 1-G were determined to be substrates for 
Factor Xllla through the use of the Fluoresence Elnhancement 
Assay. Purified lyophilized samples of each polymer were 
resuspended to 20 mg/ml in reaction buffer (100 mM Tris-HCl 
5 pH7.5, 30 mM NaCl, 1 mM EDTA) , from which a 40 fil aliquot 
was dispensed into a glass test tube. Added to tiiis sample 
were 120 ^il of monodansyl cadavarine (MDC) mix [consisting 
of 0.55 mg/ml MDC, 73.3 mM Tris-HCl pH 7.5, 40 mM DTT, 18 mM 
NaCl, and 0.6 mM ethylenediaminetetraacetic acid (EDTA) ] , 

10 200 /il activated Factor XIII (prepared by incubaiting 10 ^1 
Factor XIII enzyme preparation with: 146 ^1 100 iiM Tris-HCl 
pH7.5, 30 mM NaCl, 1 mM EDTA; 2.2 Ml 100 mM dittiiothreitol 
(DTT) ; 4 Ml Thrombin (1.0 EU/m1) (Calbiochem Cat. #605195) 
at 37 "C for one hour), and reaction buffer to bring the 

15 reaction volume to 1.6 ml. Factor XIII was purified 
according to the procedure of C. G. Curtis and L, Lorand 
(Methods in Enzymology 1976, Volume 11, p. 177). 

The reaction was incubated at 37 'C ancl progress 
monitored with periodic florescence measurements on a 

20 Sequoia-Turner model 450 fluorometer blanked against a 
reaction without Factor Xllla. The PPAS 1-F and PPAS 1-G 
proteins, which differ only in the intervening polymer 
sequences between the N-terminal and C-terminal Fjactor Xllla 
sec[uences, gave comparable fluorescence readings. After 24 

25 hours of incubation, PPAS 1-F reached a plateau value of 950 
FEU and PPAS 1-G of 1125 FEU. 

Aliquots of the above reactions were boiled in protein 
loading buffer for 5 minutes and electrophoresed on an 8 % 
SDS-PAGE gel. Upon separation of the proteins, the gel was 

30 illuminated with an ultraviolet lamp and the polymer bands 
containing the covalently attached MDC flouresced brightly. 
This provides direct evidence that the measured increase in 
fluorescence in the reactions was due to Factor Xllla 
crosslinking of the protein polymer with the MDC j!luorescent 

35 marker. These data confirm that the protein polyoaers PPASl- 
G and F are indeed substrates for Factor Xllla and can be 
used in crosslinking reactions with suitable amine donors. 
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Pre-polymerization of Protein Polymers 

Lyophilized PPAS l-F and PPAS 1-G proteins were 
solubilized for polymerization as follows. Aliqiiots of 20 
mg of each protein were weighed out, dispensed into 1.5 ml 
5 Eppendorf tubes and dissolved in 200 fil of 88% formic acid. 
The solutions were loaded with a syringe into Pierce Slide*- 
a-lyzers (Pierce Cat. #66425) with a lOK molecuilar weight 
cut off. The samples were dialyzed at 22**C for 24 hours 
versus 4 liters of lOOmM Tris-HCl pH7.5; 30mM NaCl; ImM 
10 EDTA. Upon completion of the dialysis the samples were 
removed from the Slide-a-lyzers, microfuged to remove any 
particulates, and analyzed by Lowry assay to determine the 
polymer concentration remaining in solutiion. The 
concentration of PPAS l-F was 9.6 mg/ml while that of PPAS 
15 1-G was 16.2 mg/ml. 

The polymers were prepared for crossJ.inking by 
aliquoting 20 /xl of either PPAS l-F or PPAS 1-G into 0.5 ml 
Eppendorf tubes. To these tubes 5 ^1 of activated Factor 
XIII (prepared by incubating 10 Ml Factor XIII with: 110 /il 
20 100 mM Tris-HCl pH7.5, 30 mM NaCl, 1 mM EDTA; 2.2 ^1 100 mM 
DTT; 4 Ml Thrombin (1.0 EU/m1) (Calbiochem Cat. 605195) at 
37* C for one hour) was added and the reaction was incubated 
at 37*>C for 24 hours. Samples of the crosslinking reaction 
were boiled in protein loading buffer 5 minutes and loaded 
25 on a 4-12% gradient SDS-PAGE. After separation of the 
protein bands, the gel was electroblotted onto a 
nitrocellulose filter and a Western blot was performed using 
anti-CLP antibody for PPAS l-F and anti-SLP antibody for 
PPAS 1-G. The results showed a polymerization of each 
30 polymer stepwise forming a ladder of discrete Ibands which 
corresponded in size to multimers of the unit polymer 
molecular weight. PPAS l-F multimerized to four polymers in 
length with a predicted molecular weight of 29i3 Kd, while 
PPAS 1-G showed banding to sixteen times the unit polymer in 
35 length indicating a molecular weight of 1119 Kd. 
Additionally, the PPAS l-G lane had immunoreactive material 
which migrated at the interface of the st;acking and 
resolving gels as well as material which was traipped in the 
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well indicating the presence of even larger molecular weight 
products. Finally, a reaction containing equal weights of 
both PPAS 1-F and PPAS 1-G polymers resulted in banding to 
nine times the unit polymer size with the product;s reactive 
5 to both antibodies. This indicates the abili.ty of the 
polymers to cross-link to non-self as well as identical 
molecules. 

Rat Skin Lap-Shear Adhesion Assay with PPASl-G 

10 PPASl-G was selected for testing in the lap pull assay* 

The protein sample was prepared for testing as detscribed in 
the solubilization protocol for pre-polymerization. After 
dialysis versus water, the PPAS 1-G was concentrated in a 
Speed-vac under vacuum until a volume was reached which 

15 corresponded to a polymer concentration of 100 mg/ml. The 
polymer was a clear, viscous solution at this point* The 
polymer was pre-polymerized with Factor Xllla to maximize 
the size of the individual protein species. This was 
accomplished by incubating 90 fil 100 mg/ml PPAS l -G in lOOmM 

20 HEPES pH 7.5, 30 mM NaCl; 50 mM CaClj, 2 mM DTT, in a final 
volume of 300 fxl which included 30 /il of Factor XIII and 15 
M Thrombin (1 EU/m1) (Calbiochem Cat. #605195) for 20 hours 
at 37*>C. After this initial cross-linking step the reaction 
mixture was concentrated threefold in a Speecl-vac to a 

25 volume of 100 /il and a polymer concentration of 90 mg/ml. 
To this was added 1.5 ^1 100 mM DTT, 18.5 ^1 H^O, and 30 ^1 
Factor XIII, bringing the volume to 150 Ml- Three 50 ^1 
a liquet s were dispensed into 3-500 fxl Eppendorf tubes 
containing 3 ^1 of Thrombin, mixed well and applied between 

30 a 1 cm square overlap of rat skin. The skins wei-e weighted 
to lOOg each and allowed to cure at 37** C for 2 hours. The 
skins were then mounted on a mechanical testing apparatus 
and a continually increasing force was applied in tension 
until the joint between the two skins failed. I^he average 

35 weight required to cause failure of the three replicate 
pulls was 92.9 g/cm^, which is significantly greater than a 
control of 22.8 g/cra^ which consisted of PPAS l-G without 
Factor XIII. 
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EXAMPLE 7 

Construction of SELP8K and SELP8E 

Polymers were prepared designated SELP8K and SELP8E, 
5 which are characterized by having functional groups for 
linking side chains containing reactj.ve amino acids for 
transglutaminase cross-linking. The construction of these 
polymers is described below starting from the previous gene 
monomer, SELPO (see U.S. Patent #5,243,038, pSY1298) . 

10 

SELPSK (SEQ ID NO:56) and SELP8E (SEQ ID NO:57)amino acid 
monomer sequence design: 

SELP8K MONOMER (GAGAGS)4 (GVGVP)4 GK6VP (GVGVP), 

15 SELP8E MONOMER (GAGAGS)4 (GVGVP)^ GEGVP (GVGVPjj 

SELP8 construction 

Plasmid pSY1378 (see US Patent #5,243,038) was digested 
with BanI REN, purified using agarose gel electrophoresis 

20 followed by NACS column, and the DNA was them ethanol 
precipitated in 2.5 M ammonium acetate and ligated with 
PPT0134 (See PCT\US92\09485) previously digested with Pokl 
REN, phenol/chloroform extracted and ethanol precipitated. 
The products of the ligation mixture were transformed 

25 into E.coll strain HBlOl. Plasmid DNA from transformants was 
purified and analyzed by digestion using Nrul and XmnI RENs. 
Plasmid pPT0255 containing the desired restriction pattern 
was obtained and was used for subsequent constructions. 

Plasmid DNA pPT0255 was treated with CfrlOI REN followed 

30 by RNAse. The digestion fragments were separated by agarose 
gel electrophoresis, the DNA was excised and self-ligated. 
The products of the ligation mixture were transformed into 
E.coli strain HBIOI. Plasmid DNA from transf oimants was 
purified and analyzed by digestion using Nael and StuI RENs. 

35 Plasmid pPT0267 containing the desired deletion was used for 
subsequent constructions. 

Two olgonucleotide strands as shown in Table 13 were 
synthesized and purified as described in Example 1. 
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Table 13 

5'- CTGGAGCGGGTGCCTGCATGTACATCCGAGT -3*(SECl ID NO: 58) 

3'- CCGAGACCTCGCCCACGGACCTACATGTAGGCTCA -5*(SEQ ]:D NO: 59) 

5 

The two oligonucleotide strands were annealed and ligated 
with the DNA of plasmid pPT0267 which had been previously 
digested with Banll and Seal RENs, and purified by agarose 
gel elctrophoresis followed by NACS column. 

10 The products of this ligation reaction were txansformed 

into E.coli strain HBIOI. Plasmid DNA from trsmsf ormants 
was purified and digested with Dral. Plasmid DNA from two 
clones that gave the correct digestion paittern was 
sequenced. One plasmid DNA, designated pPT0287, was found to 

15 be correct and chosen for further constructions. 

Plasmid DNA pSY1298 (see U.S* Patent # 5,243,038) was 
digested with Banll REN, and the SELPO gene fi^agment was 
purified by agarose gel electrophoresis followed by NACS and 
then ligated to pPT0287 digested with Banll. The enzyme was 

20 then removed using phenol /chloroform extraction and ethanol 
precipitation. 

The products of the ligation mixture were transformed 
into E.coli strain HBIOI. Plasmid DNA from transf ormants 
was purified and analyzed by digestion using Dral REN. 

25 Plasmid DNA from the clones showing the correct restriction 
pattern was further digested with Banll, Ahall and StuI 
RENs. Plasmid pPT0289 contained the desired SELP8-. monomer 
sequence (see Table 14) . 

30 Table 14: SELP8 Gene Monomer Sequence (SEQ ID NO: 60) 

BanI Banll 

GGT GCC GGT TOT GGA GOT GGC GOG GGC TOT GGA GTA GGT GTG <:CA 6GT 
CCA CGG CCA AGA CCT CGA COG CGC CCG AGA CCT CAT CCA CAC (SGT CCA 
35GAGSGAGAGSGVGVPG 

GTA GGA GTT CCG GGT GTA GGC GTT CCG GGA GTT GGT GTA CCT (3GA GTG 
CAT CCT CAA GGC CCA CAT CCG CAA GGC CCT CAA CCA CAT GGA CCT CAC 
40VG VPGVGVPGVGVPI3V 
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Smal 

GOT GTT CCA GGC CTA GGT CTG CCC GGG GTA GGA GTA CCA GGG GTh GGC 
CCA CAA GGT CCG CAT CCA CAC GGG CCC CAT CCT CAT GGT CCC CliT CCG 
GVPGVGVPGVCVPCVG 

5 

Banll 

GTC CCT GGA GCG GGT GCT GGT AGC GGC GCA GGC GCG GGC TCT GC;A GCC 
CAG GGA CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA CCT CGC* 
VPGAGAGSGAGAGSGA 

10 (SEQ ID NO: 61) 

Contruction of SELP8K and SELP8E Gene Monomers 

One oligonucleotide strand coding for a port;ion of the 
SELPS gene monomer was synthesized with a single base 

15 polymorphism at position 90. The use of both adenine and 
guanidine at this position produced oligonucleotides from a 
single synthesis that encoded the amino acids lysine and 
glutamic acid (see Table 15) . The synthesis was conducted 
using an Applied Biosystems DNA synthesizer mod€tl 381A and 

20 a 200oA synthesis column supplied by Glen Resear<;h. During 
the synthesis the required interrupt-pauses for bottle 
changes were minimized. After the synthesis the 202 base 
DNA fragment was deprotected and cleaved from the column 
support by treatment in 30% ammonium hydroxide at 55**C for 

25 6 hrs. 

TABLE 15 (SEQ ID NO: 62) 

5 

ATGGCAGCGAAAGGGGACCGGGCTCTGGTGTTGGAGTGCCAGGTGTCGGTGTTCCGiSGTGTAGGCGTTC 
CGGGAGTTGGTGTACCTGGA ( A/G ) AAGGTGTTCCGGGGGTAGGTGTGCCGGGCGT rCGAGTACCAGGT 
30 GTACGCGTCCCGGGAGCGGGTGCTGGTAGC6GCGCAGGCGCCGGCTCTTTCCGCTAAAGTCCTGCCGT-3' 

Two additional DNA strands were used as primers for PGR 
amplification. The two strands were: 

35 1. 5'-AAGAAGGAGATATCATATGGCAGCCAAAGGGGACC-3' (SEQ ID NO: 63) 
2- 5*-CGCAGATCTTTAAATTACGGCAGGACTTTAGCGGAAA-3'(SEQ ID NO: 64) 

The PGR reaction was carried out and the reaction 
product was purified as described in Example 1. 

The DNA was resuspended and digested with Banll REN as 
40 described in Example 1. The digested DNA was then separated 
by low-melting agarose gel electrophoresis and ligated with 
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pPT0289 previously digested with Banll RENs and purified by 
NACS column. The products of the ligation reciction were 
transformed into E.coll strain HBIOI. Plasmid DNA from 
isolated transformants was purified and analyzed by 
5 digestion using ApaLI, and EcoNI RENs. Plasmid DNA from the 
clones showing the correct restriction pattern were further 
analyzed by digestion using Asp700 REN to distinguish 
between clones encoding a lysine or glutamic acid at the 
polymorphic position. Plasmid DNA from clones containing 
10 each of the polymorphs was purified and analyzed by DNA 
sequencing. Plasmid pPT034b contained the desired SELP8K 
monomer sequence and pPT0350 contained the desired SELP8E 
monomer sequence (see Tables 16 and 17)* 

15 Table 16: SBLP8R Gene Monomer Sequence (SEQ ID NO: 65) 



Hani Banll 

GGT GCC OCT TCT GGA GOT GGC GCG GGC TCT CGT GTT GGA GTG C:CA CGT 
CCA GGC CCA AGA CCT CGA CCG CGC CCG AGA CCA CAA CCT CAC CIGT CCA 
20GAGSGAGAGSGVGVPG 



EcoNI 



GTC 


GGT 


GTT 


CCG 


GGT 


GTA 


GGC 


GTT 


CCG 


GGA 


GTT 


GGT 


GTA 


CCT 


c;ga 


AAA 


CAG 


CCA 


CAA 


GGC 


CCA 


CAT 


CCG 


CAA 


GGC 


CCT 


CAA 


CCA 


CAT 


GGA 


c;cT 


TTT 


V 


G 


V 


P 


G 


V 


G 


V 


P 


G 


V 


G 


V 


P 


C! 


K 



GGT 


GTT 


CCG 


GGG 


GTA 


GGT 


GTG 


CCG 


GGC 


GTT 


GGA 


GTA 


CCA 


GGT 


<;ta 


GGC 


CCA 


CAA 


GGC 


CCC 


CAT 


CCA 


CAC 


GGC 


CCG 


CAA 


CCT 


CAT 


GGT 


CCA 


CAT 


CCG 


G 


V 


P 


G 


V 


G 


V 


P 


G 


V 


G 


V 


P 


G 


V 


G 



Smal 



GTC 


CCG 


GGA 


GCG 


GGT 


GCT 


GGT 


AGC 


CAG 


GGC 


CCT 


CGC 


CCA 


CGA 


CCA 


TCG 


V 


P 


G 


A 


G 


A 


G 


S 



(SEQ ID NO: 66) 



Banll 

GGC GCA GGC GCG GGC TCT GGA GCG 
CCG CGT CCG CGC CCG AGA CCT CGC 
GA6AGSGA 



Table 17: SELP8E Gene Monomer Sequence (SEQ id no:67) 
Bani Banll 
40 GGT CCC GGT TCT GGA GCT GGC GCG GGC TCT GGT GTT GGA GTG CCA GGT 
CCA CGG CCA AGA CCT CGA CCG CGC CCC AGA CCA CAA CCT CAC GGT CCA 
GAGSGAGAGSGVGVPG 
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EcoNI 



GTC GOT GTT CCG GGT GTA GGC GTT CCG 
CAG CCA CAA GGC CCA CAT CCG CAA GGC 
VGVPGVGVP 



GGA GTT GGT GTA CCT GGA GAA 
CCT CAA CCA CAT GGA CCT CTT 



G 



V 



G 



V 



P 



G 



£ 



5 



Asp700 

GGT GTT CCG GGG GTA GGT GTG CCG GGC 
CCA CAA GGC CCC CAT CCA CAC GGC CCG 



GTT GGA GTA CCA GGT G1:A GGC 
CAA CCT CAT GGT CCA CliT CCG 



10 GVPGVGVPGVGVPGVG 



GTC CCG GGA 6CG GGT GCT GGT AGC GGC CCA GGC GCG GGC TCT GCiA CCG 
CAG GGC CCT CGC CCA CGA CCA TCG CCG CGT CCG CGC CCG AGA CCrT CGC 
15VPG AGAGSGAGAGSGA 

(SEQ ID NO: 68) 

SELP8K Polymer Construction 

20 Plasmid DNA from pPT0340 was digested with BanI REN 

and the digestion fragments were separated by agarose gel 
electrophoresis. The SELP8K gene fragment, 1S>2 bp, was 
excised and purified by NACS column. The purified fragment 
was ligated with plasmid pPT0317 which had been digested 

25 with BanI REN, passed through a Millipore Probind and a 
Bio-Spin 6 column. The DNA was then treated with shrimp 
alkaline phosphatase (SAP) as described in Example 1. 

The products of this ligation reaction were txansformed 
into E.coli strain HBIOI. Transf ormants were selected for 

30 resistance to kanamycin. Plasmid DNA from individual 
transformants was purified and analyzed for increase size 
due to SELP8K monomer multiple DNA insertion. Several clones 
were obtained with insert sizes ranging from 200 bp to 
approximately 7 kb. Clones containining from 6 to 32 

35 repeats, were used for expression of the SELP8K protein 
polymer (pPT0341, pPT0343, pPT0344, pPT0345 and pPT0347) . 

SELP8K Expression Analysis 

An overnight culture which had been grown eit 30**C was 
40 used to inoculate 50 ml of LB media contained in a 250 ml 
flask. Kanamycin was added at a final concentration of 50 



Smal 



Banll 
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ng per ml and the culture was incubated with agitation (200 
rpm) at aCC. When the culture reached an ODmo of O*^* *^ 
ml were transferred to a new flask prewarmed at 42 -C and 
incubated at the same temperature for approximately 2 hours. 

5 The cultures (30» and 42") were chilled on ice and ODjoo was 
taken. Cells were collected by centrifugation and divided in 
1.0 ODmo aliquots and used to perform western analysis using 
anti-SLP antibody. 

E. coli strain HBlOl containing plasmids pPT0341, 

10 PPT0343, PPT0344, pPT0345 and pPT0347 were grown as 
described above. The proteins produced by these cells were 
analysed by Western blot for detection of proteiiris reactive 
to SLP antibodies. Each clone produced a strongly reactive 
band. The apparent molecular weights of the products ranged 

15 from approximately 35kD to greater than 250 kD. Strain 
PPT0345 produced an SLP antibody reactive band of apparent 
molecular weight 80,000. The expected amino acid sequence 
of the SELP8K polymer encoded by plasraid pPT0345 is shown 
below. 

20 PPT0345 (SEQ ID NO:69) SELP8K 884 AA MW 69,772 

MDPWLQRRDWENPGVTQLNRLAAHPPPASDPMGAGSGAGAGS 
[ ( CVOVP ) 4 GKGVP ( GVGVP ) j { GAGACS ) 4 ] ,j 
( GVGVP ) 4 GKGVP ( GVGVP ) j ( GAGAGS ) , 
25 GAGAMDPGRYQDLRSHHHHHH 

SELP8K Purification 

SELP8K was produced in E. coli strain pPT0345 by 
fermentation. The product was purified from the cellular 

30 biomass by means of cellular lysis, clearance of insoluble 
debris by centrifugation, and affinity chroma toijraphy. The 
purified product was analyzed by sodium dodecyl sulfate 
polyacrylamide gel electrophoresis, immunoreactivity with a 
polyclonal antisera which reacts with silk-like peptide 

35 blocks (SLP antibody), and amino acid analysis. A protein 
band of apparent molecular weight 80,000 was observed by 
amido black staining of SDS-PAGE separated and transfered 
samples and the same band reacted with the SLP antibody on 
Western blots. As expected, amino acid analysis (shown in 
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Table 18) indicated that the product was enrichcid for the 
amino acids glycine (43.7%), alanine (12.3%), serine (5.3%), 
proline (11.7%), and valine (21.2%). The product also 
contained 1.5% lysine. The amino acid composition table 
5 below shows the correlation between the composition of the 
purified product and the expected theoretical co:npositions 
as deduced from the synthetic gene sequence. 



TABLE 18: Amino Acid Analysis of Purified CIELPBK 



10 



15 



20 



25 



30 



35 



Amino 




ACTUAL 


THEOR] 


Acid 


pmoles 


%composition 


%comp< 


Ala 


1623.14 


12.3 


12.2 


Asx 


122.20 


0.9 


0.8 


Glx 


nd 


nd 


0.4 


Phe 


58.16 


0.4 


0.1 


Gly 


5759.31 


43.7 


41.5 


His 


46.75 


0.4 


0.8 


He 


43.87 


0.3 


0 


Lys 


198.21 


1.5 


1.5 


Leu 


39.54 


0.3 


0.5 


Met 


36.01 


0.3 


0.3 


Pro 


1534.21 


11.7 


12.4 


Arg 


70.84 


0.5 


0.6 


Ser 


703.83 


5.3 


6.1 


Thr 


nd 


nd 


0.1 


Val 


2797.47 


21.2 


22.4 


Tyr 


140.87 


1.1 


0.1 



nd^none detected 

Construction of Peptide/Protein Polymer Adhesive Conjugates 
The construction of peptide/protein polymer conjugates 
that would function as substrates for Factor XIII 
crosslinking began with a screen of potential peptide 
sequences with known activities for Factor XIII. Several 
peptide sequences were tested before settling on a sequence, 
K9, first disclosed by Fickenscher et al. (Thromb. Haemostas 
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1991r 5, p. 535-540). Peptides were purified using reverse 
phase HPLC and analyzed by mass spectroscopy, 

Pep94.5 (SEQ ID NO: 70) LGPGQSKVIG-NH2 PPTpepOOlS 
5 This sequence was further modified to enable conjugation and 
characterization of the degree of substitution of the final 
conjugate. 

Pep94.14(SEQ ID NO: 71) Ac-C(nle) LGPGQSKVIG-NH2 PPTpep0017 

10 

The cysteine was added to functionalize the peptide for 
conjugation. The norleucine (nle) was added as a label to 
determine the number of peptides in the final ccin jugate. 
The glutamine in this peptide is the site for Factor 
15 Xlll-mediated crosslinking with an amine donor. A review of 
the literature disclosed lysine containing peptide sequences 
which were found to be especially active amine donors for 
Factor XIII. (J.E. Folk (1983) Adv. Enzymo. 54, p. 1-56). 

20 Pep94.11 (SEQ ID NO: 72) GGLKGGG PPTpep0020 

This sequence was similarly modified as for Pep94.11 to 
enable conjugation and characterization. 

Pep94.13 (SEQ ID NO: 73) C (nle) GGLKGGG PPTpep0021 

25 

Table 19 : Mass Spectrometry of Peptides Synthesized 



Peptide 


Theoretical 
Molecular Weight 


Mass Spec 
Molecular Weight 


P94.5 


954.11 


953.7 


P94.14 


1212.4 


1212.9 


P94.ll 


544.59 


545.8 


P94..13 


760.89 


762.2 
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These peptides were conjugated to the protein polymer, 
SELP8K which was specifically designed to be 
functionalized with different reagents reactive to amines. 
A bifunctional crosslinker , Sulfo-GMBS (N-{7- 
5 maleimidobutryloxy) sulfosuccinimide ester), purchased from 
Pierce Chemical Company (catalog # 22324) was used to 
graft the peptides to the SELP8K protein polymer. 

C95.1, polymer/peptide conjugate containing the lysine 
donor peptide [SELP8K P94.13 (PPTpep0021) ] was synthesized 

10 as follows. Prior to the reaction, two 10ml Presto columns 
(Pierce Chemical Company) were each equilibrated with 100 
mis of 20 mM NaP04 (pH 7.0) , 100 mM NaCl. 70 mg of purified 
SELP8K was brought up to 6 mis with 20 mM NaPO,^ (pH 7.0), 
100 mM NaCl. The solution was vortexed. . 50 mg of Sulfo- 

15 GMBS was added slowly to the polymer solution while 
stirring. A yellow color developed. The reaction was 
allowed to proceed while stirring for 30 min at room 
temperature. Half of the reaction was chromatog:raphed over 
each Presto column and eluted with 20 mM NaP04 (pH 7.0) , 100 

20 mM NaCl. 1 ml fractions were collected. Fractions 
containing the derivatized polymer were pooled. 

20 mg of Pep94.13 was added slowly to the above pooled 
fractions. The mixture was stirred for 16 hours at room 
temperature. 100 ;xl of ii-mercaptoethanol was added to stop 

25 the reaction. The reaction was dialyzed in a 3500 molecular 
weight cut off dialysis bag against three changes of four 
liters of deionized water. The dialyzed solution was 
lyophilized. 

C95. 2 , the polymer/peptide conjugate containing the 
30 glutamine donor peptide [SELP8K Pep94.14 (PPTpep0017) ] was 
synthesized exactly as C95.1 except that 25 mg of Pep94.14 
was added to the Sulfo-GMBS derivatized SELP8K polymer. 
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Table 20: Amino Acid Analysis of C95.;l 








Ntme 


Actual AA 

compoiitioa 

(pmolet) 


p94.13 

Theoretical AA 
composition 
(# of rendues 


c95.l-94.13 
Calculated AA 
coinpoai*tion 
Q>inoles) 


SELP8K 
Tlieorettcal AA 
conyoait ton 
(#of raaidues) 


Pe|)iidea pet 
Conju^te 

• 




ASX 


41.00 




41.00 


7 




5 


GLX 


40.00 




40.00 


4 






SER 


375.00 




375.00 


54 






GLY 


2826.00 


5 


2336.00 


367 


1 C A f\ 
XD m'kKJ 




HIS 


36.00 




36.00 


7 




10 


ARC 


25.00 




25.00 


5 




THR 


0.00 




0.00 


1 




15 
20 


ALA 


778.00 




778.00 


108 




PRO 


694.00 




694.00 


110 




TYR 


10.00 




10.00 


1 




VAL 


1237.00 




1237.00 


198 




MET 


26.00 




26.00 


3 




ILE 


0.00 




0.00 


0 




LEU 


113.00 


1 


15.00 


4 






NL£ 


98.00 


1 


0.00 


0 






PHE 


14.00 




14.00 


1 






LYS 


189.00 


1 


91.00 


13 


14.00 












Average 


14.70 












Stdev 


0.99 




THIS e 


inalysis indicates that conjugate C95.1 contains an 





average of 14.7 peptides per SELP8K polymer molecule. This 
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is greater than the number of lysines present in the polymer 
(13). 



Table 21: Amino Acid Compositional Analysis of C95.2 





Name 


C95.2 
(pinoles) 


P94.14 

Thtforettcml AA 
Con^utton (# of 
restduei) 


C95.2-P94.U 
CalcuUted AA 
Cofi^>ocUion 
(pmoles) 


SELP8K 
Theoretical AA 
Composition (1 of 
lesidues) 


Pqitidesper 
Conjugate 


5 


ASX 


29.00 




29.00 


7 






GLX 


89.00 


1 


28.00 


4 






SER 


392.00 


1 


331.00 


54 


9.95 




GLY 


2112.00 


3 


1929.00 


367 


11.61 




HIS 


27.00 




27.00 


7 




10 


ARG 


24.00 




24.00 


5 






THR 


0.00 




0.00 


1 






ALA 


636.00 




636.00 


108 






PRO 


638.00 


1 


577.00 


110 


11.63 




TYR 


8.00 




8.00 


1 




15 


VAL 


1063.00 


1 


1002.00 


198 


12.05 




MET 


8.00 




8.00 


3 






ILE 


52.00 


1 


-9.00 


0 






LEU 




1 


17 . UO 


A 
H 






NLE 


61.00 


1 


0.00 


0 




20 


PHE 


12.00 




12.00 


1 






LYS 


140.00 


1 


79.00 


13 


10.04 












Average 


X1.06 












Stdev 


0.99 
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This analysis indicates that conjugate C95.2 contains an 
average of 11 -1 peptides per SELP8K polymer molecule. 

The two conjugates C95*l and C95.2 were used in an 
adhesive formulation to test their ability to bond skin 

5 together. The formulation was produced by placing 5.7 mg of 
each conjugate in a 1.5 ml eppendorf tube. 58 Ml of 
deionized water was added and the tube vortexed for 5 min. 
5.8 Ml of IM HEPES (pH 7.5), 300mM NaCl, 6 Ml of lOOmM 
dithiothreitol, 26.8 m1 of Factor XIII, and 10.5 Ml of 500 

10 mM CaClj was added and the solution vortexed. 35 ul 
aliquots were placed in individual tubes. 

Just prior to application to skins, 3m1 of Thrombin, 1 
unit/Ml (Calbiochem #605195, 100 units brought up to IOOmI 
with enzyme dilution buffer: 50% glycerol, 6.67 mM DTT, 10 

15 mM Tris-HCl pH 7.5) was added to each aliquot. Tlie solution 
was mixed briefly and applied to a 1 cm^ area at one end of 
the subcutaneous face of a 1 cm x 3 cm strip of rat skin. 
Another skin strip was overlayed such that a la]? over area 
of 1 cm^ was produced between the two skins. The skins were 

20 wrapped in plastic wrap placed at 37«C with a weight of 100 

grams placed on them. At a specified cure time the skins j 
were brought to room temperature and promptly pmlled apart 
to measure the shear strength of the lapped bond. 

The solution was fluid before application ta^ skins. It 

25 flowed easily over the skins and wetted the slcin surface 
thoroughly. After several minutes at 37*»C, the material set 
as evidenced by the formation of a strong, flexible solid 
material. The crosslinked adhesive formulation also has 
utility as a firm, hydrogel material which can be used for 

30 medical devices such as contact lenses or as a time released 
drug delivery system. 
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Table 22: Rat Skin Lap-Shear Adhesive As!;ay 





Tett 

Conditions 


Test A 


Test 


B 


Test c 


Avenge 

BondSirengih 

(Gnnu/cm*) 


Stdev 


5 


5 min. 
cure § 
37» C 


190 g 


300 


g 


* 


245.0 


77.8 




T ar\ 

Liap 




1.2 


cin^ 










Area 














10 


30 

cure @ 
37* C 


477 g 


900 


g 


1078 g 


818.3 


308.7 




Lap 


0.96 


1.2 


cm^ 


1.3 cm^ 






15 


Area 


cm^ 













* Sample gelled in tube before application to sJcins 



EXAMPLE 9 

Additional PPAS designs incorporating non-fibrin derived 
20 crosslinking sequences. 

PPAS2 Constructions 

The results obtained from the above examples prove that 
sequences from natural proteins which serve as substrates 

25 for enzymatic crosslinking can be incorporated into novel 
polymeric protein-based substrates which function in the 
same capacity. They can either be conjugated to protein 
carriers thereby allowing their density per molecule to be 
controlled or incorporated within a protein polymer allowing 

30 additional control over their distribution along the chain. 
These results can be expanded to include sequences or 
molecules that are modified over natural sequenc:es by amino 
acid substitutions or chemical modifications for purposes of 
increasing their overall catalytic activity toward the 

35 crosslinking enzyme. The scientific literature demonstrates 
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that sequence modified peptides can result in 2-10 fold 
improvements in catalytic activity by either increasing 
their affinity for the enzyme, increasing the enzyme's 
turnover number, or both. 
5 The ability to utilize peptides in the construction of 

polymeric protein adhesive substrates that are synthetically 
designed or chemically modified to optimize their 
crosslinking activity with Factor XIII or other 
transglutaminases is further exemplified by th€i following 
10 examples • 

PPAS2-A(SEQ ID N0:74) : Lysine donor polymer dispersed 
throughout SELP8 backbone 

15 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGSGAGAGS 
[(GVGVP)4G G6LKGOG GVP (GVGVP), (GAGAGS)^ J„ 
(GVGVP)4G GGUCGGG GVP (GVGVP)j(GAGAGS) j 
GAGAMDPGRYQDLRSHHHHHH 

20 Where n=2-20. Crosslinking sequences shown in bold* 

PPAS2-B! (SEP ID N0:75> Glutamine fK9 > donor dispersed 
throughout SELP8 backbone 

2 5 MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGSGAGAGS 

[(GVGVP}4G liOPOQSKVIG GVP (GVGVP), (GAGAGS)4 ]„ 
(GVGVP)4G LGPGQSKVIO GVP (GVGVP)j(GAGAGS) 3 
GAGAMDPGRYQDLRSHHHHHH 

30 Where n=2-20. Crosslinking sequences shown in bold. 

PPAS2-Ct fSEO ID NO: 76^ Lvsine donor contained only at endg 
of SELP8 backbone 

35 HDPWLQRRDWENPGVTQLNRLAAHPPFASDAPGTP GGLKGOO GSPGA(3SGAGAGS 

[(GVGVP), (GAGAGS) 4 !« 
(GVGVP)8 (GAGAGS) 3 

GAGAMDPGRYHHAAKGDRAPGTP GGLKGGO GSPDQDLRSHHHHHH 

40 Where n=2-20. Crosslinking sequences shown in bold. 
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PPAS2-D! fSEO ID NO:77) Glutamine fKQ ) HnnoT- con^^iined onlv 
at ends of SELP8 backbone 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDAPGTPLOPOQSKVIGGSPGAGSGAGAGS 
5 ((GVGVP)i (GAGAGS) 4 )n 
(GVGVP)t (GAGAGS) 2 

GAGAMDPGRYHMAAKGDRAPGTP LGPGQSKVIG GSPDQDLRSHHHHHH 

Where n=2-20. Crosslinking sequences shown in bold. 

10 PPAS2-E; rSEO ID NO; 78) Glutamine (K9) ^ nd Lvs;ine donor 
(contained onlv at ends of SELP8 backbone) 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDAPGTP GGLKGGO GSPGAGSGAGAGS 
((6VGVP)4G LOPCQSKVIO GVP (GVGVP)3 (GAGAGS)* J, 
15 (GVGVP),G 1-OPOQSKVIO GVP (GVGVP )3(GAGAGS)2 

GAGAMDPGRYHMAAKGDRAPGTP GGLKGGG GSPDQDLRSHHHHHH 

Where n=2-20. Crosslinking sequences shown in bold* 

ypAS2-F: fSEO ID NO: 79)Mixed Glu tamnng* rK9) and Lvsine 
20 donor (di spersed throughout SELP8 backbone) 

MDPWLQRRDWENPGVTQLNRLAAHPPFASDPMGAGSGAGAGS 

( ( GVGVP ) 4 G LGPOQSKVIG GVP ( GVGVP ) j ( GAGAGS ) 4 ( GVGVP ) 4 G GGLKGGO GVP 
(GVGVP)3 (GAGAGS)*]. 

25 (GVGVP)4G LGPGQSKVIG GVP ( GVGVP), (GAGAGS) 4 (GVGVP )4G GGLKGGO GVP (GVGVP), 
(GAGAGS) 3 

GAGAMDPGRYQDLRSHHHHHH 

Where n=2-20. Crosslinking sequences shown in bold. 

30 It is apparent to someone skilled in the art that an 

endless number of combinations and arrangments ol: glutamine 
donor, lysine donor, and structural protein blocks can be 
made in order to improve the crosslinking of th€i polymeric 
substrates and the properties of the adhesives which they 

35 comprise. For example, adjustments in the stoi.cheometric 
nximber and ratio of active sites may change the overall 
crosslink density and cohesive strength of the crosslinked 
protein matrix and change its susceptibility to degradation 
when placed in or on the body. Increasing the ratio of 

40 glutamine donor sites to lysine donor sites may improve the 
adhesive bond strength by increasing the probability of 
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10 



15 



20 



25 



30 



35 



polymer crosslinking with tissue proteins containing lysine 
donors . 

Additional peptide sequences can be used in place of the 
sequences shown above. A list of examples are shiown below. 
Subsequences of the examples shown may also be sufficient 
for activity. The sequences shown may also be ex1:ended with 
amino acids on either end to improve their utility, for 
example, multiple glycine tails may be added to improve the 
sequence's accessibility to the active site of the enzyme. 



Glutamine Donors 
GQQHHLGG (SEQ ID NO: 80) 

SVLSLSQSKVLPVPE 
(SEQ ID NO: 81) 



Lvsine Donors 

XKZ, where X=L, S, A, K, R, 
andZ=G,V, H, but Xp^F, G, and Zj^p 
PXKZ, where X=L, S, A, K, R, and 
Z=G,V, H, but X7SP,G, and Z^P 



SVLSLSQSRVLPVPE GPXKZ, where X=L, S, A,, K, R, and 

(SEQ ID NO:82) Z==G, V, H, but Xy^V,G, and Zy^V 

CVLSLSQSRVLVPEC PGXKZG, where X=L, S, A, K, R, and 

(SEQ ID NO:83) Z=G, V, H, but XpiV,G, and Zj^V 

SVLSGSQSKVLPVPE(SEQ ID NO: 84) RRSKHIS (SEQ ID NO: 85) 



CLGPGQSKVIGC (SEQ ID NO: 86) 
LSLSQSKVLG (SEQ ID NO: 88) 
LGPGQSRVIG (SEQ ID NO: 90) 
LGPGQHKVIG (SEQ ID NO: 92) 
LGPGQHRVIG (SEQ ID NO: 94) 
CLGPGQSRVIGC(SEQ ID NO: 96) 
GGPGQSKVIGG (SEQ ID NO: 98) 
NQEQVSPLT (SEQ ID NO: 99) 
TDMPQMRMQL (SEQ ID NO: 100) 
CGQSKVIC (SEQ ID NO: 101) 
KVLPIPQQWPY (SEQ ID NO: 102) 
RAVPVQALL (SEQ ID NO: 103) 
LNQELL (SEQ ID NO: 104) 
TVQQEL (SEQ ID NO: 105) 
VHHQKLV (SEQ ID NO: 106) 



AKKKRS (SEQ ID NO: 87) 
AHKKGQ (SEQ ID NO: 89) 
YLKDQQ (SEQ ID NO: 91) 
PTTKMA (SEQ ID NO:93) 
RLDHKF (SEQ ID NO: 95) 
VGSNKGA (SEQ ID NO: 97) 



There is also the opportunity to derivatize protein or 
peptide substrates with non-peptide, organic amicle and amine 

40 compounds making them substrates for transglutaminase 
crosslinking. Such molecules allow for the synthesis of 
high molecular weight carrier molecules with multiple 
pendant alkane carboxamides and alkane amines. Support for 
this comes from the fact that pentyl amine, putrescine and 

45 cadaverine can all participate in transglutaminase 
crosslinking reactions with a peptide or protein-derived 
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glutamine. Derivatizing these compounds v^ith biotin or 
dansyl groups on the distal carbon from the amine does not 
abolish their reactivity. Therefore, they can be grafted 
onto a synthetic polymer, for example, or suitably modified 
5 to be co-polymerized with other monomers while retaining 
their crosslinking activity. Similar strategies would apply 
to the derivatization of a polymer chain with a pendent 
amide that would be utilized as a glutamine doner. 

10 Tissue Adhesives and Sealants 

Polymeric materials having multiple sequences which are 
capable of covalent cross-linking by enzymatic reaction have 
been produced. One use of such compositions is in the 
formulation of tissue adhesives and sealants for medical 

15 applications. There are a number of factors to be considered 
in both the design and selection of the polymeric 
compositions to be used as well as the additional components 
comprising the final product in order to optimize its 
utility. 

20 As one example of a tissue adhesive or s€:alant, the 

products comprise two principal components which are mixed 
prior to or simultaneously with the product's application. 
One component contains the factor Xllla cross-linkable 
substrate and the other contains factor Xllla. Factor Xllla 

25 can be produced by the reaction of blood-derived factor XIII 
and thrombin. Factor Xllla has been produced by means of 
recombinant DNA technology (Board, P.G., et al., ghrprobosis 
and Haemostasis . 63 (2): 235-240 (1990)). The use of 
recombinant human factor Xllla in a tissue adhesive or 

30 sealant does not require the use of thrombin. 

Using fibrin glue as a model, polymeric compositions 
have been described which are designed to incorporate the 
adhesive activity of fibrin with the opportunity to modify 
its properties through an increased number ol: available 

35 cross-linking^ sites to accelerate the set time and increase 
the mechanical strength of the final products. However, the 
speed and degree at which cross-linking will occur is 
limited by one of the same factors that limits the speed of 
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fibrin cross-linking: the size of factor Xllla (160 kDa) . It 
is known that activated factor Xllla can be digeisted with 
trypsin to yield a 51 kDa single chain fragment which has 
fibrin binding and transglutaminase activity (Greenberg, C, 
5 S., et al., Biochem, J. (1988) 256: 1013-1019). Therefore, 
to increase the speed and density of cross-linking, it is 
desireable to use a truncated, human factor Xllla which can 
be prepared by conventional or recombinant means,, 

An adhesive matrix of increased density and increased 
10 strength can be achieved by using a mixture of higher and 
lower molecular weight polymeric compositions. For example, 
where the polymeric compositions are polymers p]:oduced by 
conventional chemical synthesis (protein or o ::herwise) , 
reaction conditions are used to produce polymers with a 
15 range of molecular weights. Where the polymeric compositions 
are recombinant protein polymers, clones producing cross- 
linkable substrates of discrete molecular weight ranging 
from 10 -100 kDa (or greater) can be isolated depending upon 
the number of monomer sequences which are inserted in the 
20 final gene. 

The polymeric compositions may include additional 
peptide units to confer improved physical anci chemical 
adhesion to tissue and better handling and mechanical 
properties. These amino acid sequences may be of de novo 
25 design or derived from natural proteins, particularly human 
proteins. Sequences providing such functions have been 
identified in the literature. By way of example, decreased 
setting times may be achieved by inclusion of peptide units 
which improve the polymeric composition's cohesivi.ty through 
30 the ability to bind or chelate complexing agents (e.g. 
multivalent ions such as calcium, zinc, silver, iron or 
other metal ions) . Calcium binding sites have been 
identified in calmodulin. Complexing agents of low molecular 
weight can rapidly diffuse through the developing cross- 
35 linked matrix. Additional cohesivity can be achieved by 
inclusion of peptides containing complementary charges which 
may form interstrand salt bridges. Additional adhesive and 
cohesive stability may be achieved by inclusion of peptides 
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containing chemical cross-linking units such as those 
containing cysteine which can undergo disulfide bond 
formation under mild oxidation conditions. 

It is desireable for the tissue adhesive or sealant to 
5 promote hemostasis. Amino acid sequences which promote the 
adhesion of platelets will improve the hemostasis properties 
of the product. The amino acid sequence of Peptide 93.3 is 
known to bind to platelets. A platelet binding sequence has 
also been identified in thrombospondin. The addition of 

10 amino acid sequences containing the RGD cell attachment 
domain of fibronectin will promote the attacrhment and 
migration of many mammalian cells. 

It is important for the tissue adhesive or sealant not 
to interfere with the normal wound healing process. It 

15 should be resorbed by the body as the healing process is 
taking place. By using polymeric compositions having 
differing numbers of amino acid sequences which are 
proteolytic cleavage sites, the resorption rate of the 
product can be adjusted. The sequences may be specifically 

20 cleaved by one or more blood born or tissue associated 
proteases such as tissue plasminogen activator (tPA) , 
thrombin, and plasmin. By making the polymeric compositions 
more or less susceptible to plasmin degradation, plasmin 
inhibitors like aprotinin which delay the degradation of 

25 fibrin are also unnecessary. Such inhibitors have raised 
safety concerns. 

To increase the adhesion to tissues, peptides from 
human collagen, f ibrinopeptides, collagen binding sequences 
from fibronectin and other tissue binding proteins such as 

30 collagenase, and peptides with binding activity to 
saccharides or glycosaminoglycans and glycoproteins found in 
tissues may be used. 

Blood clotting will be prevalent in many applications 
for a tissue adhesive or sealant. Therefore, improving the 

35 product's ability to adhere to bloody tissues or tissues 
which have progressed to early stages of fibrin clotting is 
desireable. By way of example, PPASl-B was modified to 
include an amino acid sequence providing binding to fibrin 
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via a noncovalent affinity for fibrin alpha chain (see 
PPASl-D) . Fibrin gamroa chain residues 399-420 (POLSITE) 
confer the ability to polymerize adjacent fibrin molecules 
through their specific interaction with the amino terminal 

5 residues of fibrin alpha chain. 

The compositions have mechanical integrity and can be 
used to form a variety of objects. The compositions have 
mechanical strength which allows them to find numerous 
applications for use internally in mammalian hos;ts- Rates 

10 of cross-linking or setting-up can be controlled. The 
compositions may . be prepared in advance leaving out one 
component, so that the composition may be activated by 
adding the missing component, e.g. enzyme, small cross- 
linking molecule, or polymer. The compositions are 

15 physiologically acceptable, easily manipulated as solutions, 
dispersions or powders, and can provide strong bonding. The 
subject compositions afford the opportunity to introduce 
additional functional capabilities into the compositions, 
such as cell binding, chemoattractants , etc. 

20 All publications and patent applications cited in this 

specification are herein incorporated by reference as if 
each individual publication or patent application were 
specifically and individually indicated to be incorporated 
by reference. 

25 Although the foregoing invention has been described in 

some detail by way of illustration and example for purposes 
of clarity of understanding, it will be readily apparent to 
those of ordinary skill in the art in light of the teachings 
of this invention that certain changes and modifications may 

30 be made thereto without departing from the spirit or scope 
of the appended claims. 



79 



wo 95/23611 



PC]r/US95/02728 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Protein Polymer ^ Inc. 

(ii) TITLE OP INVENTION: Products Comprising Substrates Capable 
of Enzymatic Cross-Linking 

(iii) NUMBER OF SEQUENCES: 106 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

(B) STREET: Four Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 
<D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94111 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/ 

(B) FILING DATE: 
<C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/205,518 

(B) PILING DATE: 03-MAR-1994 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Rowland, Bertram I 

(B) REGISTRATION NUMBER: 20015 

(C) REFERENCE/DOCKET NUMBER: f p-58854-pc/BIR 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-781-1981 

(B) TELEFAX: 415-398-3249 



<2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Val Pro Gly Val Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Ala Gly Ala Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /note= -where x is K or E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Lys Leu Xaa Leu Ala Glu Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly 
15 10 15 



Asp Val 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Leu Ser Leu Ser Gin Ser Lys Val Leu Pro Val Pro Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Cly Gly Ala Lys Gin Ala Gly Asp Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLEOJLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Ala Gly Ala Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Val Pro Gly Val Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Gly Ala Pro Gly Ala Pro Gly Ser Gin Gly Ala Pro Gly Leu Gin Gly 
15 10 15 



Ala Pro Gly Ala Pro Gly Ser Gin Gly Ala Pro Gly Leu Gin Tyr Ret 
20 25 30 



Lys 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

val Leu Ser Leu Ser Gin Ser Lys Val Leu Pro Val Pro Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Asp 



Val 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg Gin Ala Gly Asp 



val 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Gly Ala Lys Gin Ala Gly Asp Val 
1 5 

(2) INFORMATION FOR SEQ ID NO; 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 114: 
GCTATGTTTA AACCACCTGT TCGCGATCCG GGTGCCGATC CAGGCCTGCG ATATCAGTAC 60* 
GTA 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TACGTACTGA TATCGCAGGC CTGGATCGGC ACCCGGATCG CGAACACGTG GTTTAAACAT 60 
AGC " 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGGCAGCGA AAGGGGACCG GTGCCCCGGG TACTCCTGGT CCACAAGGTC TGCCGGGAAG 60 
CCCAGGGGCT CCGGGTACTC CAGGTCCGCA AGGCCTGCCG GGTTCACCGG GTGCTCCGGG 120 
AACTCCTGGC CCGCAGGGCT TGCCGGGATC CCCAGGTGCA CCAGGAACGC CGGGACCTCA 180 
GGGTCTTCCG GGTAGCCCTC GTGCCTTTCC GCTAAAGTCC TGCCGT 226 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAGAAGGAGA TATCATATGG CAGCGAAAGG GGACC 35 
(2) INFORMATION FOR SEQ ID NO: 18: 
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(t) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 
CGCACATCTT TAAATTACGG CAGGACTTTA GCGGAAA 37 
<2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGTCCCCCGG CTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 60 
CCA6GTCCGC AACGCCTGCC CCGTTCACCG GGTGCTCCGG GAACTCCTGG CCCGCAG6GC 120 
TTGCCGGGAT CCCCAGGTGC ACCAGGAACG CCGGGACCTC AGGGTCTTCC GGGTACCCCT 180 

186 

GGTGCC 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
1^5 10 15 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
1 5 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
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Glv Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin 
50 55 60 

Ala Gly Asp Val Gly Ser Pro 
65 70 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATCCCAGCGA AAGGGGACCG TGCACCAGGA ACGCCGGGAG AAGGTCAACA GCACCATCTT 60 
GGTGGAGCGA AACAGGCAGG CGACGTCGGT AGCCCTGGTG CCTTTCCGCT AAAGTCCTGC 120 
CGT ^23 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCTGCCCCCG GTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 60 
CCAGGTCCGC AAGGCCTGCC GGGTTCACCG GGTGCTCCGG GAACTCCTGG CCCGCAGGGC 120 
TTGCCGGGAT CCCCAGGTGC ACCAGGAACG CCGGGAGAAG GTCAACAGCA CCATCTTGGT 180 
GGAGCGAAAC AGGCAGGCGA CGTCGGTAGC CCTGGTGCC 219 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

GATCTTCGAT CTCATCACCA TCACCATCAC TA 
32 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{LI) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ACCTTAGTGA TGGTGATGGT GATGAGATCG AA 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
50 55 60 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
65 70 75 80 

Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys 
85 90 95 

Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
100 105 110 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
115 120 125 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
130 135 140 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin 
145 150 155 160 

His His Leu Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly 
165 170 175 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
180 185 190 



32 
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Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
195 200 205 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
210 215 220 

Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala 
225 230 235 240 

Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
245 250 255 



Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Lciu 
260 265 270 



Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pxo 
275 280 285 



Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His 
290 295 300 



Leu Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro 

305 310 315 32!0 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 

325 330 335 



Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
340 345 350 



Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pi:o 
355 360 365 



Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Adjp 
370 375 380 



Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 

385 390 395 400 

Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 

405 410 415 



Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Siir 
420 425 430 



Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly 
435 440 445 

Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr 
450 455 460 



Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pj:o 
465 470 475 4U0 
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Glv Pro GXn Glv Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
^ 485 490 495 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly GLu 
500 S05 510 

Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Asp Val Gly 
515 520 525 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
530 535 540 



Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
545 550 555 560 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 



565 



570 



Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly flla 
580 585 590 

Lvs Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro c;ly 
^ 595 600 605 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
610 615 620 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
625 630 635 <i40 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly <!ln 
645 650 655 

Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro 
660 665 670 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
675 680 685 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
690 695 700 

Pro Glv Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
705 710 715 720 

Glv Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin 
' 725 730 735 

Ala Gly Asp Val Gly Ser Pro Gly Ala Met Asp Pro Gly Arg Tyr Gin 
740 745 750 



Asp Leu Arg Ser His His His His His His 
755 760 
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(2> INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
15 10 15 



Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 



Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 



Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg Gin 
50 55 60 



Ala Gly Asp Val Gly Ser Pro 
65 70 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GTGGAGCTCG CCAGGCAGGC GACGT 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CGCCTGCCTG GCGAGCTCCA CGAAC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
GGTCCCCCGG GTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 60 
CCAGGTCCGC AAGGCCTGCC GGGTTCACCG GGTGCTCCGG GAACTCCTGG CCCGCAG6GC 120 
TTGCCGGGAT CCCCAGGTGC ACCAGGAACG CCGGGAGAAG GTCAACAGCA CCATCTTGGT 180 
GGAGCTCCCC AGGCAGGCCA CGTCGGTAGC CCTGGTGCC 219 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
50 55 60 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
65 70 75 80 

Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg 
85 90 95 

Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
100 105 110 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
115 120 125 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
130 135 140 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin 
145 150 155 160 

His His Leu Gly Gly Ala Arg Gin Ala Gly Asp Val Gly Ser Pro Gly 
165 170 175 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
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180 185 190 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Ero 
195 200 205 

Glv Thr Pro Gly Pro Gin Cly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
210 215 220 

Thr Pro Gly Glu Gly Gin Gin Hia His Leu Gly Gly Ala Arg Gin Ala 
225 230 235 ;!40 

Glv Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin <;ly 
^ 245 250 255 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
260 265 270 

Pro Gly ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu ?ro 
275 280 285 

Gly ser Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His 
290 295 300 

Leu Gly Gly Ala Arg Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro 
305 310 315 320 

Glv Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
' 325 330 335 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
340 345 350 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
355 360 365 

Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg Gin Ala Gly Asp 
370 375 380 

Val Glv ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
385 390 395 400 

Glv Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
^ 405 410 415 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
420 425 430 

Pro Gly Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly 
435 440 445 

Glv Ala Arg Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr 
450 455 460 
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Pro Cly pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pj:o 
465 470 475 41iO 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
' 485 490 495 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Glu 
500 505 510 

Glv Gin Gin His His Leu Gly Gly Ala Arg Gin Ala Gly Asp Val Gly 
^ 515 520 525 

ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
530 535 540 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
545 550 555 5^0 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro GUy 
565 570 575 

Ala Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly J^a 
580 585 590 

Ara Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro C!ly 
^ 595 600 605 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Cly Thr Pro Gly ]>ro 
610 615 620 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
625 630 635 ^40 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Cly Glu Gly Gin 
645 650 655 

Gin His His Leu Gly Gly Ala Arg Gin Ala Gly Asp Val Gly Ser Pro 
660 665 670 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
^ 675 680 685 

Ala Pro Cly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
690 695 700 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
705 710 715 720 

Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg Gin 
725 730 735 

Ala Gly Asp Val Gly Ser Pro Gly Ala Met Asp Pro Gly Arg Tyr Gin 
740 745 750 
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Asp Leu Arg Ser His His His His His His 
755 760 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
1 5 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 

Gly Thr Pro Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TCCACCAGCA ACGCCGGGAG GTGCTAAACA AGCAGGAGAC GTCGGTAGCC CTGGTGCCTT 60 
T 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
AAACGCACCA GGGCTACCGA CQTCTCCTGC TTGTTTAGCA CCTCCCGGCG TTCCTGG 57 
(2) INFORMATION . FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGTCCCCCGG GTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 60 
CCAGGTCCGC AAGGCCTGCC GGGTTCACCG GGTGCTCCGG GAACTCCTGG CCCGCAGGGC 120 
TTGCCGGGAT CCCCAGGTGC ACCAGGAACG CCGGCAGGTG CTAAACAAGC AGGAGACCTC 180' 

195 

GGTAGCCCTG GTGCC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Asp Pro val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Ero 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
50 55 60 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
65 70 75 »0 

Pro Gly Thr Pro Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro 
85 90 95 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
^ 100 105 110 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ma 
115 120 125 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
130 135 140 

Gly Thr Pro Gly Gly" Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly 
145 150 155 160 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
165 170 175 
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Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
180 185 190 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Cly 
195 200 205 

Thr Pro Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala 



Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
225 230 235 2140 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro CHy 
245 250 255 



Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly ^?hr 
260 265 270 



Pro Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala ]>ro 
275 280 285 



Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
290 295 300 



Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 

305 310 315 320 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr l?ro 
325 330 335 



Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly 
340 345 350 



Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Cly Ala Pro Gly Thr 
355 360 365 



Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
370 375 380 



Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 

385 390 395 400 

Gly Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr 
405 410 415 



Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
420 425 430 



Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
435 440 445 



Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Gly 



210 



215 



220 



450 



455 



460 
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Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro 
465 470 475 480 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
485 490 495 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
500 505 510 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Gly *.la 
515 520 525 

Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro CHy 
530 535 540 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
545 550 555 5i60 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro CUn 
565 570 575 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Gly Ala Lys 
580 585 590 

Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly \>ro 
595 600 605 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
610 615 620 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin i31y 
625 630 635 640 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Gly Ala Lys Gin 
645 650 655 

Ala Gly Asp Val Gly Ser Pro Gly Ala Met Asp Pro Gly Arg Tyr Oln 
660 665 670 

hep Leu Arg Ser His His His His His His 
675 680 

(2) INFORMATION FOR SEQ ID NO:37: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs37: 

Thr Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys lie lie Pro Phe 
15 10 15 
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Asn Arg Leu Thr He 
20 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro GLy 
15 10 15 



Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 



Thr Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys He He Pro Phe 
35 40 45 



Asn Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala 
50 55 60 



Lys Gin Ala Gly Asp Val Gly Ser Pro 
65 70 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39; 
ATGGCAGCGA AAGGGGACCA CCGGGTGCTA CCCGTTGGTA TTCTATGAAA AAGACTACCA 
TGAAAATCAT TCCGTTTAAC CGCCTGACCA TTCGCGAAGG TCAACTTTCC GCTAAAGTCC 
TGCCGT 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGTGCCCCGG GTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 
CCAGGTCCGC AAGGCCTGCC GGGTTCACCG GGTGCTACCC GTTGGTATTC TATGAAAAAG 



60 
60 
126 



60 
120 
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ACTACCATGA AAATCATTCC GTTTAACCGC CTGACCATTG GCGAAGGTCA ACAGCACCAT 180 
CTTGGTGGAG CTCGCCAGGC AGGCGACGTC GGTAGCCCTG GTCCC 225 
<2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
1 5 10 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
50 55 60 

Ala Thr Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys lie lie Pro 
65 70 75 80 

Phe Asn Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly Gly 

90 95 



85 



Ala Lys Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro 
100 105 110 

Glv Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
' 115 120 125 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Thr Arg Trp Tyr Ser Met 
130 135 140 

Lvs Lys Thr Thr Met Lys He He Pro Phe Asn Arg Leu Thr He Gly 
145 150 155 160 

Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Asp Val 
165 170 175 

Gly Ser Pro Gly Ala Met Asp Pro Gly Arg Tyr Gin Asp Leu Arg Ser 
^ 180 185 190 

His His His His His His 
195 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ala Met Phe Lys Pro Arg Val Arg Asp Pro Gly Ala Asp Pro Cly Lfitu 
15 10 15 

Arg Tyr Gin Tyr Val 
20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
15 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
1 5 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly fila 
20 25 30 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 

Cly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin 
50 55 60 
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Ala Gly Asp Val Gly Ser Pro Gly Ala 
65 70 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
1 5 10 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 

Pro Gly Thr Pro Cly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 

Glv Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Arg Gin 
^ 50 55 60 

Ala Gly Asp Val Gly Ser Pro Gly Ala 
65 70 

(2) INFORMATION FOR SEQ ID NO: 47: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
1 5 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
20 25 30 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
35 40 45 

Gly Thr Pro Gly Gly Ala Lya Gin Ala Gly Asp Val Gly Ser Pro Gly 
50 55 60 



Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 48; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOt48: 

Gly Ala Pro Gly Thr Pro Cly Pro Gin Gly Leu Pro Gly Ser Pro Gl.y 
1 5 10 15 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly AJ.a 
20 25 30 

Thr Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys lie lie Pro Plie 
35 40 45 

Asn Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala 
50 55 60 

Arg Gin Ala Gly Asp Val Gly Ser Pro Gly Ala 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Leu Gly Pro Gly Gin Ser Lys Val He Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
Gly Glu Gly Gin Gin His His Leu Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 51: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro. Gly Ser Pro Sly 
50 55 60 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
65 70 75 80 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
85 90 95 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
100 105 110 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
115 120 125 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
130 135 140 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
145 150 155 160 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
165 170 175 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
180 185 190 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
195 200 205 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
210 215 220 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
225 230 235 240 

Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
245 250 255 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
260 265 270 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
275 280 285 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
290 295 300 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
305 310 315 320 
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Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Paro 
325 330 335 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
340 345 350 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
355 360 365 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
370 375 380 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
385 390 395 400 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
405 410 415 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
420 425 430 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
435 440 445 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
450 455 460 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
465 470 475 480 

Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
485 490 495 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
500 505 510 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
515 520 525 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
530 535 540 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
545 550 555 560 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala I'ro 
565 570 575 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
580 585 590 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
595 600 605 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
610 615 620 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro <Jly 
625 630 635 <)40 

Pro Gin Gly Leu Pro^Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
645' 650 655 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
660 665 670 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
675 680 685 
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Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
690 695 700 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
705 710 715 720 

Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
725 730 735 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
740 745 750 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Ero 
755 760 765 

Cly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro CHy 
770 775 780 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly hl& 
785 790 795 «»00 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
805 810 815 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro CJly 
820 825 830 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly :?hr 
835 840 845 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr ]»ro 
850 855 860 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Met Asp Pro Gly Arg 
865 870 875 880 

Tyr Gin Leu Ser Ala Gly Arg Tyr His Tyr Gin Leu Val Trp Cys <Sln 
885 890 895 

Lys 

(2) INFORMATION FOR SEQ ID NO: 52: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GGTGCCCCGG GTACTCCTGG TCCACAAGGT CTGCCGGGAA GCCCAGGGGC TCCGGGTACT 60 
CCAGGTCCGC AAGGCCTGCC GGGTTCACCG GGTGCTACCC GTTGGTATTC TATGAAAAAG 120 
ACTACCATGA AAATCATTCC GTTTAACCGC CTCACCATTG GCGAAGGTCA ACAGCACCAT 180 
CTTGGTGGAG CTCGCCAGGC AGGCGACGTC GGTAGCCCTG GTGCC 225 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
1 5 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met GlY Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
35 40 45 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro G].y 
50 55 60 

Ala Thr Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys He He Pro 
65 70 75 80 

Phe Asn Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly c:iy 
85 90 

Ala Arg Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr P]:o 
100 105 110 

Glv Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
^ 115 120 125 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Thr Arg Trp Tyr Ser M^st 
130 135 140 

LvB Lys Thr Thr Met Lys He He Pro Phe Asn Arg Leu Thr He Gly 
145 150 155 150 

Glu Gly cm Gin His His Leu Gly Gly Ala Arg Gin Ala Gly Asp Val 
165 170 175 

Gly ser Pro Gly Ala Met Asp Pro Gly Arg Tyr Gin Asp Leu Arg Ser 
180 185 190 

His His His His His His 
195 

2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met ASP Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
1 5 10 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Ala 
20 25 30 

pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys 
35 40 45 

Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
50 55 60 
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Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
65 70 75 80 

Glv Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
' 85 90 95 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
100 105 110 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
lis 120 125 

Glv Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro <;ly 
130 135 140 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly iJer 
145 150 155 .L60 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser ]?ro 
^ 165 170 175 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro aly 
^ 180 185 190 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
195 200 205 

pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
210 215 220 

Glv Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
225 230 235 240 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
245 250 255 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
260 265 270 

Glv Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
^ 275 280 285 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
290 295 300 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
305 310 315 320 

Glv Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
^ 325 330 335 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu 
340 345 350 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
355 360 365 

Glv ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
370 375 380 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
3B5 390 395 400 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
405 410 415 

Glv Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
^ 420 425 430 
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Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
435 440 445 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Piro 
450 455 460 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
465 470 475 4li0 

Thr Pro GXy Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Tlir 
485 490 495 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro 
500 505 510 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
515 520 525 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro 
530 535 540 

Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin 
545 550 555 550 

Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly 
565 570 575 

Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Lbu 
580 585 590 

Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro 
595 600 605 

Cly Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly 
610 615 620 

Ser Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser 
625 630 635 640 

Pro Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro 
645 650 655 

Gly Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly 
660 665 670 

Ala Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala 
675 660 665 

Pro Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro 
690 695 700 

Gly Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly 
705 710 715 720 

Thr Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr 
725 730 735 

Pro Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr fro 
740 745 750 

Gly Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Pro Gly Thr Pro Gly 
755 760 765 

Pro Gin Gly Leu Pro Gly Ser Pro Gly Ala Met Asp Pro Gly Arg I'yr 
770 775 780 

His Met Ala Ala Lys Gly Asp Arg Ala Pro Gly Thr Pro Gly Glu Gly 
785 790 795 HOO 
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Gin Gin His His Leu Gly Gly Ala Lys Gin Ala Gly Asp Val Gly Ser 

oin 815 



805 810 815 

Pro Asp Gin Asp Leu Arg Ser His His His His His His 
820 825 

) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 877 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Met Asp Pro val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly \'al 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Ala 
20 25 30 

Pro Gly Thr Pro Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys 

40 45 



35 



Gin Ala Gly Asp Val Gly Ser Pro Gly Ala Gly Ser Gly Ala Gly Ala 
50 55 60 

Gly ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
65 70 75 130 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val l?ro 
85 90 95 

Glv Val Gly val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
^ 100 105 110 

Gly Ala Gly Ala Gly ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
115 120 125 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
130 135 140 

pro Gly val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
145 150 155 160 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
165 170 175 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
180 185 190 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
^ 195 200 205 

Pro Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
210 215 220 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
' 245 250 255 

Gly ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
260 265 270 
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Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
275 280 285 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
290 295 300 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
305 310 315 320 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
^ 325 330 335 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
340 345 350 

Glv val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
^ 355 360 365 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ma 
370 375 380 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
385 390 395 <^0Q 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
405 410 415 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly «er 
^ 420 425 430 

Glv Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
435 440 445 

Gly Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
450 455 460 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val ]?ro 
465 470 475 480 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly 3er 
^ 485 490 495 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly JMa 
500 505 510 

Glv Ser Gly Val Gly Val Pro Gly val Gly Val Pro Gly Val Gly Val 
515 520 525 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
530 535 540 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
545 550 555 560 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
565 570 575 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
580 585 590 

Pro Gly Val Gly val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
595 600 605 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
^ 610 615 620 

Glv Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
625 630 635 640 
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Gly ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
645 650 655 

pro Gly Val Cly val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
660 665 670 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
675 680 685 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
690 695 700 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly V&l 
705 710 715 T20 

Pro Cly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val I'ro 
725 730 735 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly 2ler 
^ 740 745 750 

Glv Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
755 760 765 

Gly Ser Gly Val cly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
770 775 780 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
785 790 795 UOO 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Cly J5er 
805 810 815 

Glv Ala Gly Ala Gly ser Gly Ala Gly Ala Met Asp Pro Gly Arg '.ryr 
^ 820 825 830 

His Met Ala Ala Lys Gly Asp Arg Ala Pro Gly Thr Pro Gly Glu Oly 
835 840 845 

Gin Gin His His Leu Gly Cly Ala Lys Gin Ala Gly Asp Val Gly Ser 
850 855 860 

Pro Asp Gin Asp Leu Arg Ser His His His His His His 
865 870 875 

(2) INFORMATION FOR SEQ ID NOi56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
1 5 10 15 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val Cly 
' 20 25 30 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Lys Gly Val 
35 40 45 

Pro Gly Val Gly Val Pro Cly Val Gly Val Pro Gly Val Gly Val Pro 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Cly Ala Gly Ma 
1 5 10 15 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val CHy 
20 25 30 

val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Glu Gly Val 
35 40 45 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
CTGGAGCGGG TGCCTGCATG TACATCCGAG T 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
CCGAGACCTC GCCCACGGAC GTACATGTAG GCTCA 35 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GGTGCCGGTT CTGGAGCTGG CGCCGGCTCT GGAGTAGGTG TGCCAGGTGT AGGAGTTCCG 60 
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GGTGTAGGCG TTCCGGGAGT TGGTGTACCT GGACTGGCTG TTCCAGGCGT AGGTGTGCCC 120 
GGGGTAGGAG TACCAGGGGT AGGCGTCCCT GGAGCGGGTG CTGGTAGCGG CGCAGGCGCG 180 

192 

GCCTCTGGAG CG 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61: 

Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro «ly 
1 5 10 15 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
20 25 30 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
35 40 45 

Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly ;Ma 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 62; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATGGCAGCGA AAGGGGACCG GGCTCTGGTG TTGGAGTGCC AGGTGTCGGT GTTCCGCGTG 60 
TAGGCGTTCC GGGAGTTGGT GTACCTGGAA AGGTGTTCCG GGGGTAGGTG TGCOGGGCGI 120 
TGGAGTACCA GGTGTAGGCG TCCCGGGAGC GGGTGCTGGT AGCGGCGCAG GCGCGGGCTC 180 
TTTCCGCTAA AGTCCTGCCG T 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AAOAAGGAGA TATCATATGG CAGCGAAAGG GGACC 
(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 



201 
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(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CGCAGATCTT TAAATTACGG CAGGACTTTA CCGGAAA 37 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GGTGCCCGTT CTGGAGCTGG CGCGGGCTCT GGTGTTGGAG TGCCAGGTGT CGGTGTTCCG 60 
GCTCTAGCCG TTCCGGGAGT TGGTGTACCT GGAAAAGGTG TTCCGGGGGT AGGTCTGCCG 120 
GGCGTTGGAG TACCAGGTGT AGGCGTCCCG GGAGCGGGTG CTGGTAGCGG CGCAGGCGCG 180 
CGCTCTGGAG CG 

<2) INFORMATION FOR SEQ ID NO:66t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly 
1 5 10 .15 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Lys 
20 25 30 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
35 40 45 

Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
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GGTGCCXSGTT CTGGAGCTGG CGCGGGCTCT GGTGTTGGAG TGCCAGGTGT CGGTGTTCCG 60 
GGTGTAGGCG TTCCGGGAGT TGGTGTACCT 6GAGAAGGTG TTCCGGGGGT AGGTGTGCCG 120 
GGCGTTGGAG TACCACGTGT AGCCGTCCCG GGAGCGGGTG CTGGTAGCGG CGCAGGCGCG 180 
GGCTCTGGAG CG 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly 
15 10 15 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly <;lu 
20 25 30 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
35 40 45 

Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

{Pi) LENGTH: 864 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
1 5 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
35 40 45 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
50 55 60 

Lys Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
65 70 75 80 

Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
85 90 95 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
100 105 110 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly val Gly Val Pro Gly 
115 120 125 



115 



wo 95/23611 PCT/US95/02728 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly val Pro Gly Vail 
' 130 135 140 

Civ Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
145 150 X55 160 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
165 170 175 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro G;Ly 
180 185 190 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
195 200 205 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
210 215 220 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
225 230 235 240 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
245 250 255 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
260 265 270 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
275 280 285 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
290 295 300 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
305 310 315 320 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
^ 325 330 335 

Gly val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
' 340 345 350 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
355 360 365 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro dly 
370 375 380 

Lvs Glv Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
385 390 395 <l00 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly ser Gly 
' 405 410 415 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
420 425 430 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro (Sly 
435 440 445 

Lys Gly val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
450 455 460 

Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
465 470 475 480 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
485 490 495 
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Cly Val Gly Val Pro Cly Val Gly val Pro Cly Val Gly Val Pro Gly 
500 505 510 

Lys Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
515 520 525 

Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
530 535 540 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
545 550 555 560 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Cly 
^ 565 570 575 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
^ 580 585 590 

Cly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Cly Ala Gly Ser Gly 
595 600 605 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
610 615 620 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
625 630 635 <i40 

Lvs Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
^ 645 650 655 

Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
660 665 670 

Ala Gly Ala Gly ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
675 680 685 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
690 695 700 

Lvs Glv Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
705 710 715 720 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
725 730 735 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
740 745 750 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly val Gly Val Pro Gly 
755 760 765 

Lys Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
770 775 780 

Glv val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
785 790 795 800 

Ala Glv Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
805 810 815 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
820 825 830 

Lvs Glv val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
^ 835 840 845 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
850 855 860 
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Ala Glv Ala Met Asp Pro Gly Arg Tyr Gin Asp Leu Arg Ser His His 
865 870 875 880 

His His His Bis 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQbENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Gly Pro Gly Gin Ser Lys Val lie Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 

Leu Gly Pro Gly Gin Ser Lys Val lie Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Gly Gly Leu Lys Gly Gly Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Gly Gly Leu Lys Gly Gly Gly 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Vnl 
1 5 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Piro 
20 25 30 

Met Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pjro 
35 40 45 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
50 55 60 

Gly Gly Leu Lys Gly Gly Gly Gly Val Pro Gly Val Gly Val Pro Gly 
65 70 75 83 

val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser Gly 
85 90 95 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly 
100 105 110 

Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
115 120 125 

Gly Val Gly Val Pro Gly Gly Gly Leu Lys Gly Gly Gly Gly Val Pro 
130 135 140 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
145 150 155 IbO 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly 
165 170 175 

ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly Val 
180 185 190 

Pro Gly val Gly Val Pro Gly Val Gly Val Pro Gly Gly Gly Leu Lys 
195 200 205 

Glv Gly Gly Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
210 215 220 

Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly 
225 230 235 -40 

Ser Gly Ala Gly Ala Met Asp Pro Gly Arg Tyr Gin Asp Leu Arg J5er 
245 250 255 

His His His His His His 
260 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
35 40 45 

Gly Val Gly Val Pro Gly Leu Gly Pro Gly Gin Ser Lys Val He Gly 
50 55 60 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
65 70 75 80 

Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
85 90 95 

Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly 
100 105 110 

Val Gly Val Pro Gly Leu Gly Pro Gly Gin Ser Lys Val He Gly Gly 
115 120 125 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
130 135 140 

Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly 
145 150 155 160 

Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val 
165 170 175 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Leu GUy 
180 185 190 

Pro Gly Gin Ser Lys Val He Gly Gly Val Pro Gly Val Gly Val Pro 
195 200 205 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly 5ier 
210 215 220 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Met Asp Pro Gly Arg Vyr 
225 230 235 :!40 

Gin Asp Leu Arg Ser His His His His His His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
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X 



5 



10 



15 



Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Ala 
20 25 30 

Pro Gly Thr Pro Gly Gly Leu Lys Gly Gly Gly Gly Ser Pro Gly lila 
35 40 45 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val (Sly 
50 55 60 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
65 70 75 »0 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val J?ro 
85 90 95 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly iVla 
100 105 110 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly 
115 120 125 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
130 135 140 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
145 150 155 160 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
165 170 175 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly 
180 185 190 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
195 200 205 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro 
210 215 220 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
225 230 235 240 

Met Asp Pro Gly Arg Tyr His Met Ala Ala Lys Gly Asp Arg Ala Pro 
245 250 255 

Gly Thr Pro Gly Gly Leu Lys Gly Gly Gly Gly Ser Pro Asp Gin Asp 
260 265 270 

Leu Arg Ser His His His His His His 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 77: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 



275 



280 
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Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala ser Asp Ala 
20 25 30 

Pro Cly Thr Pro Leu Cly Pro Gly Gin Ser Lys Val He Gly Gly Sea: 
35 40 45 

Pro Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
50 55 60 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
65 70 75 80 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
85 90 95 

Gly val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
100 105 110 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
115 120 125 

Glv val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
130 135 140 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
145 150 155 160 

Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly 
165 170 175 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Cly Val Pro 
180 185 190 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
' 195 200 205 

val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
210 215 220 

Glv Val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser GjLy 
225 230 235 240 

Ala Gly Ala Met Asp Pro Gly Arg Tyr His Met Ala Ala Lys Gly Aiip 
245 250 255 

Arg Ala Pro Gly Thr Pro Leu Gly Pro Gly Gin Ser Lys Val He Gly 
260 265 270 

Glv ser Pro Asp Gin Asp Leu Arg Ser His His His His His His 
^ 275 280 285 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
1 5 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 
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Met Gly Ala Gly Ser Cly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
35 40 45 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
' 50 55 60 

Leu Gly Pro Gly Gin Ser Lys Val He Gly Gly Val Pro Gly Val Gly 
65 70 75 BO 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala 
85 90 95 

Glv ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
' 100 105 110 

Glv AlT-Gly Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
' 115 120 125 

Glv Val Pro Gly Val Gly Val Pro Gly Gly Gly Leu Lys Gly Gly Gly 
130 135 140 

Glv Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
145 150 155 160 

val Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
165 170 175 

Glv Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly 
' 180 185 190 

val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Leu 
195 200 205 

Gly Pro Gly Gin Ser Lys Val He Gly Gly Val Pro Gly Val Gly Val 
210 215 220 

Pro Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly 
225 230 235 240 

ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly 
245 250 255 

Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly 
260 265 270 

Val Pro Gly Val Gly Val Pro Gly Gly Gly Leu Lys Gly Gly Cly Gly 
275 280 285 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
290 295 300 

Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly 
305 310 315 320 

Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val 
325 330 335 

Glv Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Leu Gly 
^ 340 345 350 

Pro Glv Gin ser Lyff Val He Gly Gly Val Pro Gly Val Gly Val Pro 
355 360 365 

Glv Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser 
^ 370 375 380 

Glv Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
385 390 395 400 
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Gly 


Ser 


Gly 


Val 


Gly val 
405 


Pro Gly Val Gly Val Pro Gly Val Gly 
410 415 


Val 


Pro 


Gly 


Val 


Gly 
420 


Val Pro 


Gly Gly Gly Leu Lys Gly Gly Gly Gly 
425 430 


Mai 


Pro 


Gly 


Val 
435 


Gly 


Val Pro 


Gly Val Gly Val Pro Gly Val Gly Val 
440 445 


Pro 


Gly 


val 
450 


Gly 


val 


Pro Gly 


Ala Gly Ala Gly Ser Gly Ala Gly Ala GUy 
455 460 


Ser 
465 


Gly 


Ala 


Gly 


Ala Met 
470 


Asp Pro Gly Arg Tyr Gin Asp Leu Arg 
475 


Ser 
4180 


His 


His 


His 


His 


His His 







485 

) INFORKATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(0) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Asp Pro Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val 
15 10 15 

Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Asp Pro 
20 25 30 

Met Gly Ala Gly ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro 
35 40 45 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Sly 
50 55 60 

Leu Gly Pro Gly Gin Ser Lys Val He Gly Gly Val Pro Gly Val Gly 
65 70 75 80 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala 
85 90 95 

Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala 
100 105 110 

Glv Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val 
' 115 120 125 

Gly Val Pro Gly Val Gly Val Pro Gly Gly Leu Lys Gly Gly Gly Gly 
130 135 140 

Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
145 150 155 160 

Pro Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly 
165 170 175 

Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val 
180 185 190 

Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Leu Gly 
195 200 205 
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Pro Glv Gin Ser Lys Val He Gly Gly Val Pro Gly Val Gly Val Pro 
210 215 220 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly |5er 
225 230 235 -2*0 

Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala 
245 250 255 

Glv Ser Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val 
260 265 270 

Pro Gly Val Gly Val Pro Gly Gly Leu Lys Gly Gly Gly Gly Val Pro 
275 280 285 

Glv val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly val Pro Gly 
^ 290 295 300 

Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly 
305 310 315 320 

ser Gly Ala Gly Ala Gly Ser Gly Val Gly Val Pro Gly Val Gly Val 
325 330 335 

Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Leu Gly Pro Gly 
340 345 350 

Gin ser Lys Val He Gly Gly Val Pro Gly Val Gly Val Pro Gly Val 
355 360 365 

Glv Val Pro Gly Val Gly Val Pro Gly Ala Gly Ala Gly Ser Gly Ala 
370 375 380 

Glv Ala Gly ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Gly Ser 
385 390 395 400 

Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly 
405 410 415 

Val Gly Val Pro Gly Gly Gly Leu Lys Gly Gly Gly Gly Val Pro Gly 
420 425 430 

Val Gly Val Pro Gly Val Gly Val Pro Gly Val Gly Val Pro Gly Ala 
435 440 445 

Glv Ala Gly Ser Gly Ala Gly Ala Gly Ser Gly Ala Gly Ala Met Asp 
450 455 460 

Pro Gly Arg Tyr Gin Asp Leu Arg Ser His His His His His His 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Gly Gin Gin His His Leu Gly Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:81s 

Ser Val Leu Ser Leu Ser Gin Ser Lys Val Leu Pro Val Pro Glu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Ser Val Leu Ser Leu Ser Gin Ser Arg Val Leu Pro Val Pro Glu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs83: 

CVS Val Leu Ser Leu Ser Gin Ser Arg Val Leu Val Pro Glu Cys 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Ser Val Leu Ser Gly Ser Gin Ser Lys Val Leu Pro Val Pro Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Arg Arg Ser Lys His lie Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Cys Leu Gly Pro Gly Gin Ser Lys Val lie Gly Cys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ala Lys Lys Lys Arg Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Leu Ser Leu Ser Gin Ser Lys Val Leu Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

Ala His Lys Lys Gly Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:90: 

Leu Gly Pro Gly Gin Ser Arg val He Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Tyr Leu Lys Asp Gin Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Leu Gly Pro Gly Gin His Lys Val He Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Pro Thr Thr Lys Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

Leu Gly Pro Gly Gin His Arg Val He Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO:95: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 

Arg Leu Asp His Lys Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Cys Leu Gly Pro Gly Gin Ser Arg Val He Gly Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Val Gly Ser Asn Lys Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

Gly Gly Pro Gly Gin Ser Lys Val He Gly Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 99: 



129 



wo 95/23611 



PCTAJS95/02728 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 99: 

Asn Gin Glu Gin Val Ser Pro Leu Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Thr Asp Met Pro Gin Met Arg Met Gin Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Cys Gly Gin Ser Lys Val He Cys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Lvs Val Leu Pro He Pro Gin Gin Val Val Pro Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Arg Ala Val Pro Val Gin Ala Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Leu Asn Gin Glu Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Thr Val Gin Gin Glu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Val His His Gin Lys Leu Val 
1 5 
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WHAT IS CLAIMED IS ; 

1. A recombinant polymer of at least 15 kD comprised of 
at least two spaced apart enzyme recognition sequences 

5 comprising a glutamine capable of enzyme catalyzed 
isopeptide formation. 

2. A recombinant polymer according to Claim 1, 
comprising at least three of said enzyme recognition 

10 sequences, with the same intervening sequence between said 
enzyme recognition secjuences. 

3. A recombinant polymer according to Claim 2, wherein 
said intervening sequences comprise a naturally occurring 

15 functional sequence. 

4. A recombinant polymer according to Claim 3, wherein 
said naturally occurring sequence is from the fibrin gamma 
chain. 

20 

5. A recombinant polymer according to Claim wherein 
said intervening sequence comprises at least two 
repetitive units of from 3 to 18 amino acids. 

25 6. A recombinant polymer according to Claim i5, wherein 
said repetitive units have the collagen motif of every 
third amino acid being glycine. 

7. A recombinant polymer according to Claim \5, wherein 
3 0 said repetitive units consist of at least one of fibroin 

or elastin repetitive units. 

8. A recombinant polymer of at least 35 kD comprising 
alternating sequences comprising repetitive units having 

35 the collagen'motif of every third amino acid being glycine 
and enzyme recognition sequences of at least alDout 3 amino 
acids and not more than about 60 amino acids and 
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comprising a glutamine and/or lysine capable of enzyme 
catalyzed isopeptide formation. 

9. A recombinant polymer according to Claim 7, wherein 
5 said enzyme recognition sequences further comprise a 

fibrin gamma polsite. 

10. A recombinant polymer of at least 15 kD comprising 
repetitive units of from 3 to 8 amino acids and at least 2 

10 pendent groups comprising a glutamine and/or lysine 
capable of enzyme catalyzed isopeptide formation. 

11. A recombinant polymer according to Claim ].0, wherein 
said polymer is collagen and said pendent groups are 

15 consensus sequences from casein or fibrin, said consensus 
sequences having the reactive lysine substitut€id with 
another amino acid. 

12. A recombinant polymer according to Claim 10, wherein 
20 said polymer is at least 35 kD. 

13. A recombinant polymer according to Claim 10, wherein 
said repetitive units have the collagen motif of every 
third amino acid being glycine. 

25 

14. A recombinant polymer according to Claim 10, wherein 
said repetitive units consist of at least one of fibroin 
or elastin repetitive units. 

30 15. A recombinant polymer according to claim 10, wherein 
said pendent groups comprise the sequence LGPGQSKVIG. 

16. A recombinant polymer of at least 35 kD comprising 
repetitive units of from 3 to 8 amino acids and at least 2 
35 pendent groups comprising a lysine capable of enzyme 
catalyzed isopeptide formation. 
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17. A recombinant polymer according to Claim 16, wherein 
said pendent groups comprise the sequence GGLKGGG. 

18. A composition comprising a recombinant pol^Tner 

5 according to Claim 1 and a cross-linking compound other 
than said recombinant polymer. 

19. A composition according to Claim 18, wherein said 
cross-linking compound is a protein polymer comprising a 

10 plurality of lysines, 

20. A composition according to Claim 18, wherein said 
cross-linking compound is a molecule of less than 5 kD 
comprising at least two primary amino groups. 

15 

21. A composition comprising a recombinant polymer 
according to Claim 1 and Factor XIII or Factor Xllla. 

22. An implantable sterilized composition comprising a 
20 polymer according to Claim 1. 

23. An article of manufacture comprising a polymer 
according to Claim 1 cross-linked by an enzyme catalyzed 
reaction. 

25 
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